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OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



January 23, 2004, 02:14:00 ; Search time 7239.48 Seconds 

(without alignments) 
12559.330 Million cell updates/sec 

US-09-830-972-1 
3741 

1 attgctcgtctgggcggcgg gattgaagcgcaaagcagat 3 741 

I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 



22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -process ing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



45562784 



Database 



EST: * 



1: 


em_estba : * 




2 : 


em esthum:* 




3 : 


em_estin : * 




4 : 


em_estmu: * 




5 : 


em_estov : * 




6: 


em_estpl : * 




7 : 

8 : 


em estro : * 
em_htc : * 




9: 


gb_estl : * 




10 


gb_est2 : * 




11 


gb_htc : * 




12 


gb est3:* 




13 


gb_est4 : * 




14 


gb__est5 : * 




15 


em estfun:* 




16 


em_estom: * 




17 


em gss hum: 




18 


em gss inv: 




19 


em_gss_pln: 


* 


20 


em gss vrt : 


* 


21 


em gss fun: 


* 


22 


em_gss_mam: 


* 


23 


em gss mus : 




24 


em_gss_pro : 




25 


em_gss rod: 


* 


26 


em_gss_phg : 




27 


em_gss_vrl : 





28: gb_gssl:* 
29: gb__gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

o. 
o 

Result Query 

No. Score Match Length DB ID Description 



1 


772.4 


20. 


.6 


969 


13 


BU839934 


BU839934 


AGENCOURT 


2 


767 .2 


20. 


. 5 


785 


14 


CA511870 


CA511870 


UI-R-FJ0- 


3 


753 .4 


20. 


. 1 


842 


13 


BU709149 


BU709149 


UI-M-EW0- 


4 


745 


19. 


. 9 


896 


14 


CB204418 


CB204418 


AGENCOURT 


5 


725.6 


19. 


.4 


796 


14 


CA504729 


CA504729 


UI-R-FJ0- 


6 


709.8 


19. 


. 0 


805 


12 


BI730192 


BI730192 


603349739 


7 


709.4 


19. 


. 0 


822 


14 


CB521332 


CB521332 


UI-M-GH0- 


8 


707 . 8 


18 . 


. 9 


986 


13 


BU841009 


BU841009 


AGENCOURT 


9 


684.6 


18. 


.3 


919 


13 


BU590898 


BU590898 


AGENCOURT 


10 


673 .6 


18 . 


. 0 


778 


13 


BU709106 


BU709106 


UI-M-EW0- 


11 


662.6 


17. 


. 7 


777 


14 


CA320618 


CA320618 


UI-M-FW0- 


12 


660 .4 


17. 


. 7 


802 


14 


CA320635 


CA320635 


UI-M-FW0- 


13 


659 .2 


17. 


.6 


951 


13 


BQ892001 


BQ892001 


AGENCOURT 


14 


648 .6 


17. 


.3- 


739 


13 


BU612951 


BU612951 


UI-M-FR0- 


15 


638 .4 


17 . 


. 1 


742 


14 


CA320833 


CA320833 


UI-M-FW0- 


16 


638 


17 . 


. 1 


638 


14 


CB576696 


CB576696 


AMGNNUC : C 


17 


633.8 


16. 


. 9 


779 


14 


CB244702 


CB244702 


UI-M-FY0- 


18 


633 .4 


16. 


. 9 


862 


9 


AU079375 


AU079375 AU079375 


19 


616 


16. 


.5 


700 


12 


BI664179 


BI664179 


603289106 


20 


608 .2 


16. 


.3 


935 


13 


BQ963057 


BQ963057 


AGENCOURT 


21 


606.8 


16. 


.2 


751 


14 


CA315995 


CA315995 


UI-M-FW0- 


22 


605 


16. 


,2 


673 


14 


CD349457 


CD349457 


UI-M-FY0- 


23 


604.8 


16. 


.2 


623 


14 


CB578453 


CB578453 


AMGNNUC : C 


24 


604 .8 


16. 


.2 


691 


13 


BU707644 


BU707644 


UI-M-FR0- 


25 


598 


16. 


. 0 


609 


14 


CB580803 


CB580803 


AMGNNUC : N 


26 


589.2 


15. 


. 7 


914 


9 


AU079162 


AU079162 AU079162 


27 


574 .2 


15. 


.3 


782 


12 


BI739239 


BI739239 


603359521 


28 


570 


15. 


.2 


810 


10 


BG668013 


BG668013 


DRABTB12 


29 


568 . 6 


15. 


.2 


698 


13 


BU058441 


BU058441 


UI-M-FO0- 


30 


561 


15. 


.0 


624 


14 


CB578355 


CB578355 


AMGNNUC : N 


31 


561 


15. 


.0 


646 


9 


AA791734 


AA791734 vu08b07 . r 


32 


561 


15. 


. 0 


964 


13 


BQ900768 


BQ900768 


AGENCOURT 


33 


560 


15. 


. 0 


3533 


11 


AK034902 


AK034902 


Mus muscu 


34 


559.8 


15 . 


, 0 


567 


12 


BI289826 


BI289826 


UI-R-DK0- 


35 


556.8 


14 . 


. 9 


717 


14 


BY756291 


BY756291 


BY756291 


36 


556 


14 . 


. 9 


556 


14 


CB613337 


CB613337 


AMGNNUC : N 


37 


550.8 


14 . 


. 7 


588 


10 


BF563033 


BF563033 


UI-R-BOl- 


38 


547.4 


14 . 


.6 


691 


14 


CB525239 


CB525239 


UI-M-FY0- 


39 


546 . 8 


14 . 


.6 


817 


14 


CA322433 


CA322433 


UI-M-FX0- 


40 


545.8 


14 . 


6 


630 


13 


BQ769602 


BQ769602 


UI-M-FI0- 


41 


543 .4 


14 . 


5 


781 


14 


CA322160 


CA322160 


UI-M-FX0- 


42 


542 


14 . 


5 


548 


10 


BF562389 


BF562389 


UI-R-BU0- 


43 


541.4 


14 . 


5 


686 


10 


BB307864 


BB307864 


BB307864 


44 


537.4 


14 . 


4 


679 


12 


BI149602 


BI149602 


602848410 


45 


537.4 


14. 


4 


708 


12 


BI157842 


BI157842 


602923001 



ALIGNMENTS 



RESULT 1 
BU839934 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BU839934 969 bp mRNA linear EST 16-OCT-2002 

AGENCOURT_894 7611 NIH_MGC_130 Mus musculus CDNA clone IMAGE : 632 98 90 
5*, mRNA sequence. 
BU839934 

BU839934 .1 GI : 24024317 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 969) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Mark Maconochie, Ph.D. and Nancy L. Freeman, 
Ph.D. 

cDNA Library Preparation: ResGen, Invitrogen Corp 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Agencourt Bioscience Corporation 

Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : // image . llnl . gov 

Plate: LLAM13783 row: g column: 11 

High quality sequence stop: 651. 
Location/Qualifiers 
1. .969 

/organism="Mus musculus" 
/mol_type=''mRNA" 
/db_xref="taxon: 10090" 
/clones "IMAGE: 6329890" 
/lab_host="DH10B (phage-resistant ) " 
/ cl one_l ib= " NI H_MGC_13 0 " 

/note=" Organ: otocysts ; Vector: pCMV-SPORT6 . 1 . ccdb; 
Site_l: EcoRV; Site_2 : NotI ; Cloned unidirectionally . 
Primer: Oligo dT. Average insert size 1.95 kb. 
Constructed by ResGen, Invitrogen Corp. Note: this is a 
NIH_MGC Library. " 
325 a 194 c 192 g 256 t 2 others 



Query Match 20.6%; Score 772.4; DB 13; Length 969; 

Best Local Similarity 89.6%; Pred. No. 5.8e-112; 

Matches 878; Conservative 0; Mismatches 88; Indels 14; 



Gaps 



4; 



Qy 2172 CATGAATGTAG CACTAAAAG CTTTGGGAACAAAGGAAGGAATAAAAGAG CCTGAAAGTTT 2231 

Mill Mill:!:! I I II MINIMI III I M I II 1 1 1 1 1 1 II 1 1 i 

Db 1 CATGAGTGTAGCACTAAAAACATCGGACTCAAAGGAAGAAATTAAAGAGCCTGAAAGTTT 60 



Qy 


2232 


TAATG CAG CTGTTCAGGAAACAGAAG CT CCTTATATAT CCATTGCGTGTGATTTAATTAA 

MINIM MM MMIMMIMM MIMI M Illlllllllllll 

TAATG CAG CTG CT CAGGAAG CAGAAG CT C CTTATATAT CCATTG CATGTGATTTAATTAA 


2291 


Db 


61 


120 


Qy 


2292 


AGAAACAAAGCTCTCC^CTGAGCCAAGTCC^GATTTCTCTAATTATT 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIIIIIIIIIIIIMI 

AGAAACAAAGCTCTCCACTGAGCCAAGTCC^GAGTTCTCTAATTATTC^ 


2351 


Db 


121 


180 


Qy 


2352 


ATTCGAGAAGTCGGTG CCCGAACACG CTGAG CTAGTGGAGGATTCCTCAC CTGAAT CTGA 

III IIIIIIIMIIMI II III MINI Mill 1 M MM llllllll 

ATTTGAGAAGTCGGTGCCTGATCACTGTGAGCTCGTGGATGATTCCTCACCCGAATCTGA 


2411 


Db 


181 


240 


Qy 


2412 


ACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCCACAAACACAAGAGGAGGC 

IIIMIIilllllllllllllllllll MUM MIMI M Mill! MIMM 

AC(^GTTGACTTATTTAGTGATGATTC^TTCCTGAAGTCCC^CAAAC^CAAGAGGAGGC 


2471 


Db 


241 


300 


Qy 


2472 


TGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTCTGAGACAGTAGC^ 

M MM MIMM IIIIIIIIMIIMM MIMI 1 1 II MIMI 

TGTGATGCTAATGAAGGAGAGTCTCACTGAAGTGTCTGAGACA.GTAACACAACACAAACA 


2529 


Db 


301 


360 


Qy 


2530 


-GAGGAGAGACTTAGTGCCTCACCTCAGGAGCTAGGAAAGCCATATTTAGAGTCTTTTCA 

IIIIIIIIMIIMM MIIIIMM II MUM II IMMMMMM 

TAAGGAGAGACTTAGTGCTTCACCTCAGGAGGTAGGAAAGCCATATTTAGAGTCTTTTCA 


2588 


Db 


361 


420 


Qy 


2589 


GCCCAATTTACATAGTACTWUVGATGCTGCATCTi^ 

IMIMIIIIIIII IIIIMIIIIIIIIIIIIMIMI IIIIIIMIMIIIIIMII 

GCCCAATTTACATATTACAAAAGATGCTGCATCTAATC 


2648 


Db 


421 


480 


Qy 


2649 


GGAGAAAATTTCTTTGCAAATGGAAGAGTTTAATACTGCAATTTATTCAAATGATGACTT 

Mill IIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIII MIIMIMM 

GGAGACAATTTCTTTGCAAATGGAAGAGTTTAATACTGCAATTTATTCCAATGATGACTT 


2708 


Db 


481 


540 


Qy 


2709 


ACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTGAAACATTTTC^GATTCATCTCCGAT 

1 1 1 II I 1 f 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 f 1 1 ! 1 1 1 1 1 1 1 1 lllllilllll II 

1 1 1 1 1! II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 lllllilllll II 

ACTTTCTTCTAAGGAAGACAAAATGAAAGAAAGTGAAA(^TTTTCCGATTCATCTCCCAT 


2768 


Db 


541 


600 


Qy 


2769 


TGAGATAATAGATGAATTTCCCACGTTTGTCAGTG CTAAAGATGATTCT C CTAAATTAG C 

Mlllllllllllll llllllll MIIMIMM MIIMIMM 

TGAGATAATAGATGAGTTTCCCACATTTGT CAGTG CTANAGATGATTCT CCT 


2828 


Db 


601 


652 


Qy 


2829 


(^GGAGTACACTGATCTAGAAGTATCCGACAAAAGTGAAATTGCTAATATCCAAAGCGG 


2888 


Db 


653 


Illlllllllllll llllllllllll lillllllllllllllllll MM Mill 

-AAGGAGTACACTGACCTAGAAGTATCCAAOW^GTGAAATTGCTAATGTCCAGAGCGG 


711 


Qy 


2889 


GGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAATATATATCC 

II MM MIIIIMM MIMIMMIMMIIMMMMMIMM MUM 

NGCCAATTCGTTGCCTTGCTCAGAATTGCCCTGTGACCTTTCTTTCAAGAATACATATCC 


2948 


Db 


712 


771 


Qy 


2949 


TAAAGATGAAGTACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGC 

lllllilllll MIMI IMM Mill 1 Ml MIMIMIMII Mill 

TAAAGATGAAGCACATGTCTCAGATGAATTCT - CAAAAGTAGGTCCAGTGTATCTAAGGT 


3008 


Db 


772 


830 


Qy 


3009 


ATCCATATCGCCTTt^W^TGTCTCTGCTTTGGAACCTCAGAC^GAAATGGGCAGCATAGT 

II III II 1 1 MM 1 MIMI MM 1 II MIMM MIMI 

GCCCCTATTGCTTCCCAATGGTTTCTGCTTGGAATCTCAAATAG-AATGGGCCACATAGT 


3068 


Db 


831 


889 


Qy 


3069 


TAAATCC^ATCACTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTGA(^CAGAGAAAGA 


3128 



I II Mill llllll 1 1 1 1 1 M 1 1 II llllllllllllll I llllllli 

Db 8 90 TTAACCCAAAGTACTTACGGAAGAAGCAGAGGAAAAACTTCCTTCTTGATCCGAGAAAGA 94 9 



Qy 312 9 GGACAGATCCCTGTCAGCTG 314 8 

II llllllli llllll 
Db 950 GGGACGATCCCTGACAGCTG 96 9 



RESULT 2 
CA511870 

LOCUS CA511870 785 bp mRNA linear EST 15-NOV-2002 

DEFINITION UI-R-FJ0-cpx-e-15-0-UI . rl UI-R-FJO Rattus norvegicus cDNA clone 

UI-R-FJ0-cpx-e-15-0-UI 5*, mRNA sequence. 
ACCESSION CA511870 

VERSION CA511870.1 GI:25002824 

KEYWORDS EST . 

SOURCE Rattus norvegicus (Norway rat) 

ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

REFERENCE 1 (bases 1 to 785) 

AUTHORS Bonaldo,M.F. , Lennon,G. and Soares ; M.B. 

TITLE Normalization and subtraction: two approaches to facilitate gene 
discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
MEDLINE 97044477 
PUBMED 8 88 9548 
COMMENT Contact: Soares, MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF , Iowa City, IA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email: bento-soares@uiowa.edu 

Tissue Procurement: Dr. James Lin, Universtiy of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Researchers may obtain clones from Research 
Genetics (www.resgen.com). 
Seq primer: M13 REVERSE. 
FEATURES Location/Qualifiers 
source 1. .785 

/organism=" Rattus norvegicus" 

/mol_type= " mRNA " 

/ s t ra in^ " Sprague - Da wl ey " 

/db_xref ="taxon: 10116" 

/clone- "UI-R-FJO -cpx-e-15-0-UI » 

/ 1 i ssue_type= " embryo " 

/dev_s tage= " embryo " 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_l ib= "UI -R-FJO " 

/note= n Vector : pYX-Asc; SiteJL: EcoR I; Site_2: Not I; 
UI-R-FJO is a cDNA library containing the following 
tissue(s): rat embryo. The library was constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 



BASE COUNT 
ORIGIN 



6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned directionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail. The 
sequence tag for this library is CATCTCTACT. This library 
was created for the University of Iowa Program for Rat 
Gene Discovery and Mapping (Val Sheffield, Bento Soares 
and Tom Casavant) " 
251 a 174 c 165 g 193 t 2 others 



Query Match 20.5%; Score 767.2; DB 14; Length 78 5; 

Best Local Similarity 99.2%; Pred. No. 3.9e-lll; 

Matches 78 0; Conservative 0; Mismatches 5; Indels 1; Gaps 



1; 



Qy 16 9 9 ATAACAGAGAAGACTAG CCCCAAAACGT CAAATC CTTTCCTTGTAG CAGTACAGGATT CT 1758 

Mill IIMII Mill llllll II Ml IIMIIIII II III II 

Db 1 ATAACAGAGAAGACTATCCCCACAACGT CAAATC CTTTCCTTGTAG CAGTACAGGATTCT 60 



Qy 175 9 GAGGCAGATTATGTTACAACAGATACCTTATCAAAGGTGACTGAGGCAGCAGTGTCAAAC 1818 

Mill MIMMII IIMII llllll II Ml IIMIIIII II III II 

Db 61 GAGG CAGATTATGTTACAACAGATACCTTATCAAAGGTGACTGAGG CAG CAGTGT CAAAC 120 

Qy 1819 ATG CCTGAAGGT CTGACGCCAGATTTAGTT CAGGAAG CATGTGAAAGTGAACTGAATGAA 1878 

IMIM MMIIMI IMIM MMM II llllll MIMMII llllll 

Db 121 ATG CCTGAAGGTCTGACG CCAGATTTAGTTCAGGAAG CATGTGAAAGTGAACTGAATGAA 180 

Qy 1879 GCCAt^GGTACAAAGATTGCTTATGAAACAAAAGTGGACTTGGTCCAAA^ 193 8 

Mill IIMIIIII IMIM MIMIMIMIMII MIMIMMMMIM 

Db 181 GCCACAGGTACAAAGATTGCTTATGAAACAAAAGTC 24 0 

Qy 193 9 ATACAAGAATCACTTTACCCCACAGCACAGCTTTGCCC^ 1998 

Mill IIMIIIII IMIM MIMMII IIMII MIMIIM MMM 

Db 241 ATACAAGAATC^CTTTACCCC^CAGCACAGCTTTGCCCATCATTTGAGGAAGCTGAAGC^ 3 00 

Qy 1999 ACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGCACCATTAAATTCTCTCCTTCCA 2 058 

IMM MMIMMMIMM MIMMII IIMII MIMIIM MMM 

Db 3 01 ACTCCGTC^CCAGTTTTGCCTGATATTGTTATGGAAGCACCATTAAATTCTCTCCTTCCA 3 60 

Qy 2 05 9 AGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATCCCCACTGGAAGCACCTCCTCCAGTT 2118 

MIMI IMMMMMMIM MIM, II llllll MIMMII llllll 

Db 361 AGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATCCCCACTGGAAGCACCTCCTCCAGTT 420 

Qy 2119 AGTTATGACAGTATAAAGCTTGAG C CTGAAAACCCCC CACCATATGAAGAAG CCATGAAT 2178 

Mill MIMIIM IIMII Mill 1 1 Mill III I II Mill MIMMII 

Db 421 AGTTATGACAGTATAAAGCTTGAGCCTGAAAATCCCCCACCATATGAAGAAGCCATGAAT 480 

Qy 2179 GTAGCACTAAAAG CTTTGGGAACAAAGGAAGGAATAAAAGAGCCTGAAAGTTTTAATG CA 2238 

IMM IIMII II IMIM llllll IIMII II llllll IMIM || 

Db 481 GTAG CACTAAAAGCTTTGGGAACAAAGGAAGGAATAAAAGAGCCTGAAAGTTTTAATG CA 54 0 

Qy 223 9 GCTGTTCAGGAAAC^GAAGCTCCTTATATATCCATTGCGTGTGATTTAATTAAAGAAACA 22 98 

MMM MMM II MIMMMMM IIIIIIMI llllll MIMMII 

Db 541 GCTGTTCAGGAAACAGAAGCTCCTTATATATCCATTGCGTGTGATTTAATTAAAGAAACA 600 



Qy 22 99 AAGCTCTCCACTGAGCCAAGTCCAGATTTCTCTAATTATTCAGAAATAGCAAAATTCGAG 2358 

IIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIMIIIIIIII MINIM 

Db 6 01 AAG CT CT C CACTGAG CCAAGTCCAGATTTCTCTAATTATTCAGAAATAG CANAATTCGAG 660 

Qy 2359 AAGTCGGTGCCCGAACACGCTGAGCTAGTGGAGGATTCCTCACCTGAATCTGAACCAGTT 2418 

I ! I- III 1 1 1 II M 1 1 III I MM! 1 1 II I i I III 1 1 1 II Ml I II 1 1 1 M Ml II 

Db 661 AAGTCGGTGCCCGAACACGCTGAGCTAGTGGAGGATTCCTCACCTGAATCTGAACCAGTT 72 0 

Qy 2419 GACTTATTTAGTGATGATTCGATTCCTGAAGTCCCACAAACACAAGAGGAGGCTGTGATG 24 78 

III MIMIIIIMIIIII llllll M Mill Mill! M II 1 1 1 1 1 1 1 1 

Db 721 GACTTATTTAGTGATGATTCGATTCCTGAAGT - CCACANACACAAGAGGAGGCTGTGATG 779 

Qy 2479 CTCATG 2484 

llllll 

Db 780 CTCATG 785 



RESULT 3 
BU709149 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS v 
TITLE 
JOURNAL 

COMMENT 



EST 26-NOV-2002 



FEATURES 

source 



842 bp mRNA linear 
-0-UI.rl NIH_BMAP_EW0 Mus musculus cDNA clone 
mRNA sequence. 



Craniata; Vertebrata ; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus. 



BU709149 

UI-M-EW0-caz-o-10- 
IMAGE: 6419553 5 ' , 
BU709149 

BU70914 9. 1 GI : 23 642332 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 842) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Dr. James Lin, Univeristy of Iowa 
cDNA Library preparation: Dr. M. Bento Soares , University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : //image . llnl . gov 
This clone was contributed by the Brain Molecular Anatomy Project 

(BMAP) 

Seq primer: pYX-5. 

Location/Qualifiers 
1. .842 

/organism="Mus musculus" 
/mol_type=" mRNA" 
/strain="C57BL/6" 
/db_xref ="taxon: 10090" 
/clone=" IMAGE: 6419553" 
/tissue_type="whole brain" 
/dev_stage=" embryo 15.5 dpc" 
/lab_host="DH10B (Tl phage resistant) " 
/clone lib="NIH BMAP EWO " 



/note="Organ: brain; Vector: pYX-Asc; Site_l: EcoR I; 
Site_2: Not I; The library was constructed according to 
Bonaldo, Lennon and Soares, Genome Research, 6:791-806, 
1996. Denatured mRNa was size fractionated on a 1% agarose 
gel . First strand cDNA synthesis was primed with an 
oligo-dT primer containing a Not I site. Double stranded 
cDNA was size selected according to mRNA size fraction, 
ligated with EcoR I adaptor, digested with Not I, and then 
cloned direct ionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
, is GTGCGTGGAA. This library was created for the 
University of Iowa Mouse Brain Molecular Anatomy Project 
(BMAP) : 'Gene Discovery in the Developing Mouse Nervous 
System', supported by National Instututes of Mental Health 
(NIMH), Hemin Chin, Ph.D., program coordinator." 

BASE COUNT 275 a 182 c 181 g 202 t 2 others 

ORIGIN 

Query Match 20.1%; Score 753.4; DB 13; Length 842; 

Best Local Similarity 94.2%; Pred. No. 5.8e-109; 

Matches 792; Conservative 0; Mismatches 48; , Indels 1; Gaps 1; 
Qy 1677 AGAAGAAAGGAAGGCCCAAATTATAACAGAGAAGACTAGCCCCAAAACGTCAAATCCTTT 173 6 

M I! llllll MIMI MMM M MMM IMMIMI MUM II M 

Db 2 AGAAGAAAGGAAGGCCCAAATTATAACAGAGAAGACTAGCCCC^^ 61 

Qy 173 7 CCTTGTAGC^GTACAGGATTCTGAGGOVGATTATC^ 1796 

II II II II 1 1 III IIIIIU: Mil MINIUM MM II ) 

Db 62 CCTTGTAGCAATACATGATTCTGAGGCAGATTATGTCA 121 

Qy 1797 GACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGA 1856 

II M III MM III IIIIIMIMIIMI IMIMM llllll II M 

Db 122 GACTGAGGCAGTAGTGGC^CCATGCCTGAAGGTCTAACGCCAGATTTAGTTCAGGAAGC 181 

Qy 1857 ATGTGAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAAGTGGA 1916 

II I! MIIMMII I MUM II MIMI MIMIMMMMM II I; 

Db 182 ATGTGAAAGTGAACTGAACGAAGCCAQVGGTACAAAGATTGCT 241 

Qy 1917 CTTGGTCCAAACATCAGAAGCTATACAAGAATCACT^ 1976 

1 1 1 1 1 1 1 1 i IIMIIIIMIIIIMIIII III 

Db 242 CTTGGTCCAGACATCAGAAGCTATACAAGAGTC^ 3 01 

Qy 1977 ATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGC 2 03 6 

M 

Db 3 02 ATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGC 3 61 

Qy 2 037 ACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATCCCC 2 096 

MIMIII M llllll Ml MMMMMIMM MIMMMII llllll 

Db 362 GCCATTAAATTCTCTCCTTCCAAGCACTGGTGCTTCTGTAGCGCAGCCCAGTGCATCCCC 421 

Qy 2 097 ACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAACCCCCC 2156 

III MM Ml MMMMMIMM II II 1 1 1 II 1 1 M 1 1 1 1 M II I Mill 

Db 422 ACTAGAAGTACCGTCTCCAGTTAGTTATGACGGTATAAAGCTTGAGCCTGAAAATCCCCC 481 

Qy 2157 ACCATATGAAGAAGCCATGAATGTAGCACTAAAAGCTTTGGGAAO\AAGGAAGGAATA^ 2216 

Mill IIIMIIII IIIM IIMIMIIIIII I I II lllllllll III II 

Db 4 82 ACCATATGAAGAAGCCATGAGTGTAGCACTAAAAACATCGGACTCAAAGGAAGAAATTAA 541 



Qy 2217 AGAGCCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATATCCATTGC 2276 

,1111111 I III Ml M I MM II I M M 

Db 54 2 AGAGCCTGAAAGTTTTAATGCAGCTGCTCAGGAAGCAGAAGCTCCTTATATATCCATTGC 601 

Qy 2277 GTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTCTCTAATTA 233 6 

I I I M II I I ; I II ! i M I I II MM 

Db 602 ATGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGAGTTCTCTAATTA 661 

Qy 2337 TTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTGGAGGATTC 2396 

Mill 1 1 1 1 ; I II II Ml Ml Mill Mill 

Db 662 TTCAGAAATAGCANAATTTGAGAAGTCGGTGCCTGATCACTGTGAGCTCGTGGATGATTC 721 

Qy 23 97 CTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCCACA 2456 

llllll II MINI IIIIM IIIIMII Mill lllllllllll Mill 

Db 722 CTCACCCGAATCTGAACCAGTTGACTTATTTAGTGATGATTCAATTCCTGAAGT - CCACA 780 

Qy 24 57 AACACAAGAGGAGG CTGTGATGCT CATGAAGGAGAGTCTCACTGAAGTGTCTGAGACAGT 2516 

MINI II IMIII Mil! 1 1 1 1 1 : I i 1 1 i 1 1 1 I [ 1 3 1 1 1 1 1 : 1 1 1 ; 1 1 

Db 781 NACACAAGAGGAGGCTGTGATGCTAATGAAGGAGAGTCTCACTGAAGTGTCTGAGACAGT 84 0 

Qy 2517 A 2517 

I 

Db 841 A 841 



RESULT 4 
CB204418 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus . 



CB204418 896 bp mRNA linear EST 05-FEB-2003 

AGENCOURT_11276017 NIH_MGC_135 Mus musculus cDNA clone 
IMAGE:30138586 5', mRNA sequence. 
CB204418 

CB204418.1 GI:28241848 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 896) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs -r@mail .nih.gov 
Tissue Procurement: Dr. David Rowe 
cDNA Library Preparation: Invitrogen Corp 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : / / image . 1 lnl . gov 
Plate: NDAM0041 row: k column: 11 
High quality sequence stop: 686. 
Location/Qualifiers 
1. .896 

/organism="Mus musculus" 
/mol_type= "mRNA" 



/ db_xr e f = " t axon : 1 0 0 9 0 " 
/clones "IMAGE: 30138586" 
/lab_host="DH10B (phage-resistant ) " 
/clone_lib= M NIH_MGC_135" 

/no te= "Vector: pCMVSport6 . 1 ; Site_l : EcoRV; Site_2: Not I; 
Normalized full-length enriched library from pooled mouse 
embryonic limb, maxilla and mandible, day 12.5, 13.5, 14.5 
, and 15.5 (size selected for the 0.5-1 kb fragments) 
Cloned directionally , priming method: Oligo-dT. cDNA 
enrichment: >lk bp, Average insert size 1.6k bp. 
Normalization (Cot value): 7.5 kb. Priming sequence: 
5 ' GACTAGTTCTAGATCGCGAGCGGCCGCCC (T) 3 1 Tissue contributed by 
, David Rowe. Library constructed by ResGen, Invitrogen 
Corp . " 

BASE COUNT 255 a 177 c 193 g 271 t 

ORIGIN 

Query Match 19.9%; Score 745; DB 14; Length 8 96; 

Best Local Similarity 93.0%; Pred. No. 1.2e-107; 

Matches 816; Conservative 0; Mismatches 50; Indels 11; Gaps 3; 
Qy 274 5 AACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGC 2 8 04 

MINIMI IMMIIMM MIIIMIIIIMIIII MM. IN MIMIMIM 

Db 13 AAC^TTTTCCGATTO^TCTCCCATTGAGATAATAGATGAGTTTCCCACATTTGTCAGTGC 72 

Qy 28 05 TAAAGATGATTCT C CTAAATTAG C CAAGGAGTACACTGATCTAGAAGTAT CCGACAAAAG 2 864 

IMIMMMMMII II MINI 1 1 Ml I llllll III III MM 

Db 73 TAAAGATGATTCT C CT AAGGAGTACACTGACCTAGAAGTATCCAACAAAAG 123 

Qy 2865 TGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGA 2 924 

IMMMIIMM MM lllllll! MM IMIMIMI IMIMMMIMM 

Db 124 TGAAATTGCTAATGTCCAGAGCGGGGCCAATTCGTTGCCTTGCTCAGAATTGCCCTGTGA 183 

Qy 2 925 CCTTTCTTTCAAGAATATATATCCTAAAGATGAAGTACATGTTTCAGATGAATTCTCCGA 2 984 

N II II II lllllll IMIMIMI llllll MMIMMMIMI I 

Db 184 CCTTTCTTTCAAGAATAC^TATCCTAAAGATGAAGCACATGTCTCAGATGAATTCTCCAA 243 

Qy 2 985 AAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTCTCTGCTTTGGAACC 3 044 

ii iiiiiiiiiiiiiiiiiiii ii mi ii i iiiiiii ihi i 

Db 244 AAGTAGGTCCAGTGTATCTAAGGTGCCCTTATTGCTTCCAAATGTTTCTGCTTTGGAATC 3 03 

Qy 3 04 5 T CAGACAGAAATGGGCAG CATAGTTAAATC CAAATCACTTACGAAAGAAG CAGAGAAAAA 3104 

Ml I III IIMI I MUM Mill MIMM MIIIMI MM 

Db 3 04 T CAAATAGAAATGGGCAACATAGTTAAACC CAAAGTACTTACGAAAGAAG CAGAGGAAAA 3 63 

Qy 3105 ACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTATTGTCAGCAGAGCT 3164 

IIIIIIIIMII IIIMIIIMIIIIIMIIIIMI 1 1 i 1 1 i J I i 1 1 1 1 ; 1 1 1 1 1 1 1 1 

Db 3 64 ACTTCCTT CTGATACAGAGAAAGAGGACAGAT CC CTGACAG CTGTATTGTCAGCAGAG CT 423 

Qy 3165 GAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGT 3224 

II Mill II MUM IIMI 1 1 1 1 ! I 1 1 1 1 1 1 1 i I M 1 1 1 1 1 1 1 1 1 1 1 

Db 424 GAATAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGT 4 83 

Qy 322 5 GTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAAC 32 84 

lilllllllMlllllllilllll Mill Mill IllllllllllllilllllllUI 

Db 484 GTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAAC 543 



Qy 3285 GG C CTACATTG C CTTGG CC CTG CTCTCGGTGACTATCAG CTTTAGGATATATAAGGG CGT 3344 

IMIIIIIIIIIIIIIIIIIIIIIIII I I I U M I II I !U II 

Db 544 GGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGT 603 

Qy 3345 GATCCAGGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGGATATTTAGAATCTGA 34 04 

I I I II I I ! I I: ! :l I I Ml: II 

Db 604 GATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGA 663 

Qy 34 05 AGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGTGAA 34 64 

ill 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 ! MM I i MIMMII IMMI Mill I 

Db 664 AGTTGCCATATCAGAGGAATTGGTTCAGAAATATAGTAATTCTGCTCTTGGTCATGTGAA 723 

Qy 34 65 CAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAA 3524 

i i i ii - i ii i i i i i .ni i i i 

Db 724 CAGCACAATAAAAGAATTGAGGCGTCTCTTCTTAGTTGATGATTTAGTTGATTCCCTGAA 783 

Qy 3 525 GTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCC - TTGTTCAATGGTCTGACAC 3583 

I IMM MIMI M I II 1 1 1 II I Mill llllllllllll III 

Db 784 G - TTGCAGTGTTGATGTGGGTATTTACTTACGTTGGTGCCTTTGTTCAATGGTTTGACAC 842 

Qy 3 584 TACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCC 362 0 

II MIMI IM I II MMM MIMI II 

Db 843 TACTGATTTTAGCCCTGATCTCACTCTTCAGTATTCC 879 



RESULT 5 

CA504729/c 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



796 bp mRNA linear EST 14-NOV-2002 
0-UI.sl UI-R-FJ0 Rattus norvegicus cDNA clone 
0-UI 3', mRNA sequence. 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Rodent ia; Sciurognathi ; Muridae; Murinae; 



CA504729 

UI-R-FJ0-cpx-e-15 
UI-R-FJO-cpx-e-15 
CA504729 

CA504 729.1 GI : 24 9 95683 
EST. 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 
Eukaryota; Metazoa; 
Mammal ia ; Euther ia ; 
Rattus . 

1 (bases 1 to 796) 

Bonaldo,M. F. , Lennon,G. and Soares, M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 

Contact: Soares, MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF , Iowa City, IA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email : bento-soares@uiowa . edu 

Tissue Procurement: Dr. James Lin, Universtiy of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Researchers may obtain clones from Research 



Genetics (www.resgen.com) . 

The following repetitive elements were found in this cDNA 
sequence: 1-35, >POLY_A#Simple_repeat (matched compliment) 
Seq primer: M13 FORWARD 
POLYA=Yes . 

FEATURES Location/Qualifiers 
source 1. .7 96 

/organism="Rattus norvegicus" 

/mol_type=" mRNA " 

/ s t ra in= " Sprague - Dawl ey" 

/dbjcref = 11 1 axon: 10116" 

/clone="UI-R-FJ0-cpx-e-15-0-UI" 

/ 1 i s sue_type= " embryo " 

/ dev_s t age = " embryo " 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib= lf UI-R-FJO ,! 

/note= "Vector: pYX-Asc; Site_l : EcoR I; Site_2 : Not I; 
UI-R-FJO is a cDNA library containing the following 
tissue(s): rat embryo. The library was constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned direct ionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first -strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail. The 
sequence tag for this library is CATCTCTACT. This library 
was created for the University of Iowa Program for Rat 
Gene Discovery and Mapping (Val Sheffield, Bento Soares 
and Tom Ca savant) 
TAG_LIB=UI -R-FJO 
TAG_T I S SUE = r a t - embryo 
TAG_SEQ= CATCTCTACT" 

BASE COUNT 179 a 184 c 134 g 297 t 2 others 

ORIGIN 



Query Match 19.4%; Score 725.6; DB 14; Length 796; 

Best Local Similarity 99.3%; Pred . No. 1.4e-104; 

Matches 728; Conservative 0; Mismatches 5; Indels 0; Gaps 0; 



Qy 


952 


TTTAAAGAA(^TGGATACCTTGGTAACTTAT(^GCAGTGTCATCCTCAGAAGGAAa^TT 

1 1 1 1 1 II 1 1 1 1 ! II 1 M I 1 1 II 1 M 1 

TCTAAAGAACATGGATACCTTGGTAACTTATCAGCAGTGTCATCCTCAGAAGGAACAATT 


1011 


Db 


733 


674 


Qy 


1012 


GAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCAGAGAGGGCAACAAATCCATTTGTA 

MINIMI lllllll Mill MMII Mill llllll IMIIIMM Ml lllllll 

GAAGAAACTNTAAATGAAGCTTCTAAAGAGTTGCCAGAGAGGGCAACAAATCCATTTGTA 


1071 


Db 


673 


614 


Qy 


1072 


AATAGAGATTTAGCAGAATTTTCAGAATTAGAATATTCAGAAATGGGATCATCTTTTAAA 

MMIMMIIIIMMMIIMMIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIII 

AATAGAGATTTAGCAGAATTTTCAGAATTAGAATATTCAGAAATGGGATCATCTTTTAAA 


1131 


Db 


613 


554 


Qy 


1132 


GGCT C CCCAAAAGGAGAGTCAG C CATATTAGTAGAAAACACTAAGGAAGAAGTAATTGTG 

MINI II llllil llllll llilll II llllll IMIIIIinillll 

GG CTCCCCAAAAGGAGAGTCAG C CATATTAGTAGAAAACACTAAGGAAGAAGTAATTGTG 


1191 


Db 


553 


494 



Qy 1192 AGGAGTAAAGACAAAGAGGATTTAGTTTGTAGTGCAGCCCTTCAC^GTCCAC^AGAATCA 1251 

I I II I I I I I I I I I I I II I I I M I I I I I I II I I II I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 4 93 AGGAGTAAAGACAAAGAGGATTTAGTTTGTAGTG CAG CCCTTCACAGTC CACAAGAATCA 434 

Qy 1252 CCTGTGGGTAAAGAAGACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAA 1311 

IIIIMMMIIMMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 433 CCTGTGGGTAAAGAAGACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAA 374 

Qy 1312 ATGCAGATGTCAGTAGTAGCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAA 1371 

Jlllll IIIMIIII llllll llllll MINIM! IIIIIM! IIIMI 

Db 3 73 ATGCAGATGTCAGTAGTAGCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAA 314 

Qy 13 72 CAAGCATGGGAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTGCTGG CTG CTAGAG CT 1431 

MINN IIIMIII! IMMIMIIIII llllillll MINI I! IMIil. 

Db 313 CAAGCATGGGAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTGCTGG CTG CTAGAG CT 254 

Qy 1432 AATGTGGAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAG CCTGGAG CAAAAAAGT CTT 14 91 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 253 AATGTGGAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAG CCTGGAG CAAAAAAGT CTT 194 

Qy 14 92 GGGAAGGATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTG 1551 

MINI |M|||||! |!Mi| MMm !l ililll II III II IIIIM 

Db 193 GGGAAGGATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTG 134 

Qy 1552 AAGGAC^GCTCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGC^ACCGAAAGCACC 1611 

IIIIIMMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

Db 133 AAGGACAGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCGAAAGCACC 74 

Qy 1612 ACAGCAAACACTTTCCCTTTGTTAGAAGATCATACTTCA^ 1671 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIMIIIIMII 

Db 73 ACAGCAAA(^CTTTCCCTTTGTTAGAAGAT(^TACTT(^GAAAATAAAAaVGATGAAAAA 14 

Qy 1672 AAAATAGAAGAAA 1684 

I I I I I I I III 

Db 13 AAAAAAAAAAAAA 1 



RESULT 6 
BI730192 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BI730192 805 bp mRNA linear EST 20-SEP-2001 

603349739F1 NIH_MGC_94 Mus musculus cDNA clone IMAGE : 5357385 5', 
mRNA sequence. 
BI730192 

BI730192.1 GI:15707205 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 805) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: The Cepko Laboratory 
cDNA Library Preparation: Life Technologies, Inc. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : / /image . llnl .gov 
Plate: LLAM11908 row: n column: 10 
High quality sequence stop: 8 02. 

Locat ion/Qual i f iers 

1. .805 

/organism^ "Mus musculus" 
/mol__type=" mRNA " 
/db_xref = 11 1 axon: 10090" 
/clone=" IMAGE: 535738 5" 
/tissue_type=" retina " 
/lab_host="DH10B (phage-resistant ) " 
/ c 1 one_l ib= n NI H_MGC_9 4 " 

/note= "Organ: eye; Vector: pCMV-SPORT6; Site_l: Not I; 
Site_2 : Sail; Cloned unidirectional ly; oligo-dT primed. 
Average insert size 3.3 kb. Library enriched for 
full-length clones and constructed by Life Technologies. 
Note: this is a NIHJ4GC Library." 
266 a 179 c 165 g 195 t 



Query Match 19.0%; Score 709.8; DB 12; Length 805; 

Best Local Similarity 93.8%; Pred. No. 4.4e-102; 

Matches 751; Conservative 0; Mismatches 47; Indels 3; Gaps 



1; 



Qy 


1854 


AGC^TGTGAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAAGT 

MIIIIIIIMMIIIIIIII MM llllll M IIMM M IIMM MM 

AGCATGTGAAAGTGAACTGAACGAAGCCACAGGTACAAAGATTC 


1913 


Db 


1 


60 


Qy 


1914 


GGACTTGGTCCAAACATCAGAAGCTATACAAGAATCA 

llllllllllll III. II INI III IIMM M llllll MM 

GGACTTGGTCCAGACTVTCTVGAAGCTATACAAGAGTC^ 


1973 


Db 


61 


120 


Qy 


1974 


CCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGA 

Ml llllll M IIMM MUM llllll II MMM MMIMM Mill 

CCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGA 


2033 


Db 


121 


180 


Qy 


2034 


AGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATC 

III MM 11,11 IIMIIIII IIMM 1 1 1 1 1 1 1 1 1 1 ' 1 1 1 1 1:11111111 III 

AGCGCCATTAAATTCTCTCCTTCCAAGCACTGGTGCTTCTGTAGCGCAGCCCAGTGCATC 


2093 


Db 


181 


240 


Qy 


2094 


CCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAACCC 

llllll MM III IIIIIIIIIIIMIIII 1 1 1 1 1 1 1 1 M 1 1 1 1 1 ! 1 1 1 1 1 i II 

CCCACTAGAAGTACCGTCTCCAGTTAGTTATGACGGTATAAAGCTTGAGCCTGAAAATCC 


2153 


Db 


241 


300 


Qy 


2154 


CCCACCATATGAAGAAGCCATGAATGTAGCACTAAAAGCTTTGGG^ 

1 IMIIIMIMM MM 1 llllll M 1 1 II IIMIIIII III 

CCCACCATATGAAGAAGC(^TGAGTGTAGCACTAAAAACATCGGACT(JAAAGGAAGAAAT 


2213 


Db 


301 


360 


Qy 


2214 


AAAAGAGCCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATATCCAT 

II 1 1 M 1 MM 1 : iiilin I | i | i| | || 

TAAAGAGCCTGAAAGTTTTAATGCAGCTGCTCAGGAAGCAGAAGCTCCTTATATATCCAT 


2273 


Db 


361 


420 



Qy 



2274 TGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTCTCTAA 2333 



Ml MIMIIIIIIIMIMIMIIIMIIIIIIMIIIIMIIIIIIII HIM 

Db 421 TGCATGTGATTTAATTAAAGAAA 48 0 

Qy 2334 TTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTGGAGGA 23 93 

M 1 1 M i 1 11 1 1 1 M 1 1 1 1 1 1 MIIMIIIIIIM II 111 llllll Mill II 

Db 481 TTATTCAGAAATAGCAAAATTTGAGAAGTCGGTGCCTGATCACTGTGAGCTCGTGGATGA 54 0 

Qy 2394 TTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCC 24 53 

MINI MMMMMMMMMMMMMMMIMM 1 1 1 1 II 1 1 1 1 1 1 1 i 

Db 541 TTCCTCACCCGAATCTGAACCAGTTGACTTATTTAGTGATGATTCAATTCCTGAAGTCCC 600 

Qy 2454 ACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTCTGAGAC 2513 

MMMMMMMMMMMIMM ! 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 J 1 1 ! I 

Db 601 ACAAACACAAGAGGAGGCTGTGATGCTAATGAAGGAGAGTCTCACTGAAGTGTCTGAGAC 660 
Qy 2514 AGTAGCCCAGCACAAA GAGGAGAGACTTAGTGCCTCACCTCAGGAGCTAGGAAAGCC 2570 

MM I II llllll III II MUM IMMMIMM II MM I ! 

Db 661 AGTAACACAACACAAACATAAGGAGAGACTTAGTGCTTCACCTCAGGAGGTAGGAAAGCC 72 0 

Qy 2571 ATATTTAGAGTCTTTT CAG CC CAATTTACATAGTACAAAAGATGCTG CAT CTAATGACAT 2630 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 f 1 1 i Ml IMMMIMMIMI III M 

Db 721 ATATTTAGAGTCTTTTCAGCCCAATTTACATATTACCAAAGATGCTGCATCTACTGAAAT 78 0 

Qy 2 631 TCCAACATTGACCAAAAAGGA 2651 

IIIIIIIIIIIIIIIMMII 
Db 781 TCCAACATTGACCAAAAAGGA 8 01 



RESULT 7 
CB521332 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



EST 28-MAR-2003 



822 bp mRNA linear 
-13-0-UI.rl NIH_BMAPJ3H0 Mus musculus cDNA clone 
5*, mRNA sequence. 



Craniata ; Vertebrata ; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus. 



FEATURES 

source 



CB521332 
UI-M-GHO-cem-h- 
IMAGE: 6841502 
CB521332 

CB521332 .1 GI : 2 9354 687 
EST . 

Mus musculus {house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 822) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Distribution information can be found at 

http : //genome . uiowa . edu/distribution/mousef 1 . html 
This clone was contributed by the Brain Molecular Anatomy Project 

(BMAP) 

Seq primer: pYX-5 . 

Location/Qualifiers 
1. .822 



/organism="Mus musculus" 
/ mo 1 _ t yp e = " mRNA " 
/strain="C57BL/6" 
/db_xref="taxon: 10090" 
/clone=" IMAGE: 6841502" 
/tissue_type="Whole brain" 
/dev__stage=" 1 , 5, and 15 days newborn" 
/lab_host="DH10B (Tl phage resistant)" 
/ c 1 one_l ib= " N I H_BMAP_GH0 " 

/note="Organ: Brain; Vector: pYX- Asc; Site_l: EcoR I; 
Site_2: Not I; The library was constructed according 
Bonaldo, Lennon and Soares, Genome Research, 6:791-8 06, 
1996. Denatured RNA was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with oligo-dT 
primer containing a Not I site. Double strand cDNA was 
size selected according to mRNA size fraction, ligated 
with EcoR I adaptor, digested with NotI and then cloned 
direct ionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
is CGAACTGAAT. This library was created for the University 
Iowa Brain Anatomy Project (BMAP) : 'Gene Discovery in the 
Developing Mouse Nervous System', supported by National 
Institute of Mental Health (NIMH), Hemin Chin, Ph.D., 
program coordinator." 

BASE COUNT 233 a 162 c 181 g 246 t 

ORIGIN 

Query Match 19.0%; Score 709.4; DB 14; Length 822; 

Best Local Similarity 92.6%; Pred. No. 5.1e-102; 

Matches 771; Conservative 0; Mismatches 51; Indels 11; Gaps 2; 



Qy 2762 CTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGCTAAAGATGATTCTCCTA 2821 

1 1 1 1 1 1 1 ! 1 1 ; i 1 1 1 j 1 1 1 1 1 iniiiii 1 1 j 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 j 1 1 1 1 1 1 1 1 

Db 1 CTCCCATTGAGATAATAGATGAGTTTCCCACATTTGTCAGTGCTAAAGATGATTCTCCT- 59 

Qy 2 822 AATTAGCCAAGGAGTACACTGATCTAGAAGTATCCGACAAAAGTGAAATTGCTAATATCC 28 81 

llllllllllllll IIIIIMIIMI llllllllllllllllllll III 

Db 60 AAGGAGTACACTGACCTAGAAGTATCCAACAAAAGTGAAATTGCTAATGTCC 111 

Qy 2882 AAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAATA 2 941 

I INIIIII 1 1 II MINIMI MINI IMIMMIIMI IMMMIMM 

Db 112 AGAGCGGGGCCAATTCGTTGCCTTGCTCAGAATTGCCCTGTGACCTTTCTTTCAAGAATA 171 

Qy 2 942 TATATCCTAAAGATGAAGTACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTAT 3001 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 MINI lllllllllllllll Ml llllllllllllll 

Db 172 CATAT C CTAAAGATGAAG CACATGTCT CAGATGAATT CT CCAAAAGTAGGTC CAGTGTAT 231 

Qy 3 002 CTAAGGCATCCATATCGCCTT(^WVTGTCTCTGCTTTGGAACCTC^GACAGAAATGGGav 3061 

MINI II III II I lllllll . i 11.11 I I M I I I M I, 

Db 232 CTAAGGTGCCCTTATTGCTTCCAAATGTTTCTGCTTTGGAATCTCAAATAGAAATGGGCA 2 91 

Qy 3 062 GCATAGTTAAATCCAAATCACTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTC 3121 

MIMMIII Mill III 1 1 1 II 1 1 1 MM II I 1 1 II I II IMM 1 1 1 1 1 1 

Db 2 92 AC^TAGTTAAACCCAAAGTACTTACGAAAGAAGCAGAGGAAAAACTTCCTTCTGATACAG 351 

Qy 3122 AGAAAGAGGACAGATCC CTGTCAGCTGTATTGTCAG CAGAG CTGAGTAAAACTTCAGTTG 3181 

llllllllllllllllllll lllllllll llllllillllllll llllllllllllll 



Db 



352 AGAAAGAGGACAGATCCCTGACAGCTGTATTGTCAGCAGAGCTGAATAAAACTTCAGTTG 411 



Qy 3182 TTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTAT 3241 

Illlllllll ; I ! I ! ; I 1 1 1 1 1 1 i I 1 1 1 1 II 1 1 1 1 1 1 ! 1 1 1 1 i I II III! 1 1 

Db 412 TTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTAT 471 

Qy 3242 TCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTGG 3301 

IIIIIMIMMIMIIIIIIIIII ,111111 llllll, II llllll Mill 

Db 472 TCCTGCTGCTGTCTCTGACAGTGTT - - TCATTGTCAGTGTAACGGCCTACATTGCCTTGG 52 9 

Qy 3302 CCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAGA 3361 

Illlllllll I 1 1 M I I I I I : I i I : I I I : I I I I I I Ml 111 Illlllllll 
Db 53 0 CCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGGTGTGATCCAAGCTATCCAGA 589 

Qy 3362 AATCAGATGAAGGCCACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCAGAGG 3421 

IIIMMUiNII II llillUIN lllllllllllll! Illlllllll 

Db 590 AATCAGATGAAGGCCACCCATTCAGGGCATATTTGGAATCTGAAGTTGCCATATCAGAGG 64 9 

Qy 3422 AATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAAGAAC 34 81 

IIIIIIIIIIMIMI IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMI 

Db 650 AATTGGTTCAGAAATATAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAAGAAT 709 

Qy 3482 TGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATGT 3541 

MM II MMMI II II: II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 710 TGAGGCGTCTCTTCTTAGTTGATGACTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATGT 769 

Qy 3542 GGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTA 35 94 

1 1 1 1 MINIM MMMI MUM Mill MMMMMM MM 

Db 77 0 GGGTATTTACTTACGTTGGTGCCTTGTTCAATGGTTTGACACTACTGACTTTA 822 



RESULT 8 
BU841009 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BU841009 986 bp mRNA linear EST 16-OCT-2002 

AGENCOURT_l 0187690 NIH_MGC_134 Mus musculus cDNA clone 
IMAGE: 6518816 5', mRNA sequence. 
BU841009 

BU841009 . 1 GI : 24 0254 09 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 986) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: Dr. David Rowe 
cDNA Library Preparation: Invitrogen Corp 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Agencourt Bioscience Corporation 

Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : / / image . 1 lnl . gov 

Plate: LLAM14101 row: c column: 09 



FEATURES 

source 



BASE COUNT 
ORIGIN 



High quality sequence start: 21 
High quality sequence stop: 644. 

Location/Qualif iers 

1. .986 

/organ ism="Mus musculus" 
/mol_type= M mRNA n 
/db_xref="taxon: 10090" 
/clones" IMAGE: 6518816" 
/tissue_type= "undifferentiated limb" 
/lab_host="DH10B (phage-resistant) " 
/ c 1 one_l i b= " N I H_MGC_1 3 4 " 

/note="Vector: pCMV-SPORT6 . 1 . ccdb; Site_l: EcoRV; Site_2 : 
Not I; Cloned unidirectionally . Primer: Oligo dT. Average 
insert size 1.7 kb. Constructed by ResGen, Invitrogen 
Corp. Note: this is a NIH_MGC Library." 
302 a 227 c 208 g 248 t 1 others 



Query Match 18.9%; Score 707.8; DB 13; Length 986; 

Best Local Similarity 87.8%; Pred. No. 8.9e-102; 

Matches 832; Conservative 0; Mismatches 103; Indels 13; Gaps 



5; 



Qy 1728 AAATCCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTT 1787 

MIMIIIIIIIIIMIII MM MMMMMMMMMM MMMMM 1 1 

Db 2 9 AAATCCTTTCCTTGTAGCAATACATGATTCTGAGGCAGATTATGTCACAACAGATAATTT 88 



178 8 ATCAAAGGTGACTGAGGCAGCAGTGTCAAACAT^ 1847 



MIMMMMMMMI 



M Ml MMMMMMMI MMMMI 



Qy 

Db 8 9 ATCAAAGGTGACTGAGGC^GTAGTGG(^C(^TGCCTGAAGGTCTAACGCC^GATTTAG^ 14 8 

Qy 184 8 TCAGGAAG CATGTGAAAGTGAACTGAATGAAGC CACAGGTACAAAGATTG CTTATGAAAC 1907 

1 1 II I MM 1 1 1 1 1 1 1 1 1 ; I ; I MMMMIMMMMMMMMMMMI 

Db 14 9 TCAGGAAG CATGTGAAAGTGAACTGAACGAAGC CACAGGTACAAAGATTG CTTATGAAAC 2 08 

Qy 1908 AAAAGTGGACTTGGTCCAAACATCAGAAGCTATACA^ 1967 

MMMMMMMMM MMMMMMMMIMI IM MMMMMMMM 

Db 209 AAAAGTGGACTTGGTCCAGACATCAGAAGCTATACA 268 

Qy 1968 GCTTTGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGT 2 027 

MMMMM MMMMMMMMMM MM MM MM MMMMM MM MM 

Db 269 GCTTTGCCCATCATTTGAGGAAGCTGAAGC^ACTCCGTCACCAGTTTTGCCTGATATTGT 328 

Qy 2 028 TATGGAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAG 2087 

lllllllll MIMIIIIIIIIIIIIMIMII MINI III III II lllllllll 

Db 32 9 TATGGAAGCGCCATTAAATTCTCTCCTTCCAAGCACTGGTGCTTCTGTAGCGCAGCCCAG 388 

Qy 2 088 TGTATCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGA 2147 

II lllllllll 1 1 1 1 III I llllll IIIIIIIIM II IIMI llllll IIMI 

Db 38 9 TGCATCCCCACTAGAAGTACCGTCTCCAGTTAGTTATGACGGTATAAAGCTTGAGCCTGA 448 

Qy 2148 AAACCCCCCACCATATGAAGAAGCCATGAATGTAGCACTAAAAGCTTTGGGAACAAAGGA 22 07 

III ill I II I I I III I I I II I I I I II I I lllllllllllll I ! II llllll 

Db 44 9 AAATCCCCCACCATATGAAGAAGCCATGAGTGTAGCACTAAAAACATCGGACGay^AGGA 5 08 

Qy 22 08 AGGAATAAAAGAG CCTGAAAGTTTTAATG CAGCTGTTCAGGAAACAGAAG CTCCTTATAT 22 67 

M Ml I II MM MMIM II MM 1 1 1 II 1 1 M llllll Mill 

Db 509 AGAAATTAAAGAGCCTGANAGTTTTAATG CAG CTGCTCAGGAAG CAGAAGCTCCTTATAT 568 



Qy 
Db 

Qy 
Db 

Qy 

Db 
Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



2268 ATCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTT 2327 

MINIMI IMIIMMIMIMIMIMIMMIMIMMMMMMIMM II 

56 9 ATCCATTG CATGTGATTTAATTAAAGAAACAAAG CTCTC CACTGAG CCAAGTC CAGAGTT 628 
2328 CTCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGT 2387 

MIMIIIII MIIIIMI MM MIIMIMIMM II III I MM II 

62 9 CTCTAATTATTCAGAAATAG CAAAATTTGAGAAGTCGGTG CCTGAT CACTGTGAG CTCGT 688 
23 8 8 GGAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGA 244 7 

III I II I Ml: Mill I 1 1 1 1 1 1 M M 1 1 1 1 1 i 11 1 1 1 II Mill 

689 GGATGATTCCTCACCCGAATCTGAACCAGTTGACTTATTTAGTGATGATTCAATCCCTGA 748 
244 8 AGTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTC 2507 

III M M M II M 1 1 MM II Mill MM Ml I MM 

74 9 AGTCCCACCAACACAAGAGGAAGCTGTGATGCTTATGAAAGAGAGTCTCACCTGAATGTC 8 08 



2 5 08 TGAGACAGTAGCCCAGC - 



- ACAAAGAGGAGAGACTTAGTG - 



- -CCTCACCTCAGGAG 2559 



Ml Ml Ml I Ml I MMMMMM II ! II Ml 

8 09 TGAGACAGTTACCCCACCCCAACATAAGGGAGAGACTTAGTGCTTTCCCCTCCGGAGGGT 8 68 
2560 CTAGGAAAG C CATATTTAGAGTCTTTT - - CAGCCCAATTTACATAGT - - ACAAAAGATGC 2615 

I I II I i 1 1 1 1 1 1 1 1 1 i I Mllllllll 1 1 I MM MM 

8 69 AGAAAAGGCCCTATTTTAGAGTCTTTTTCAGCCCCAATTTACCTATTTACCAAAGGATGC 928 
2616 TGCAT - CTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTT 2662 

III I HUM I M II II I Ml Ml I III 

92 9 TGCCTCCTAATGAAAATTCCACCTTTGGCCCAAAAGGGAGACCATTTT 976 



RESULT 9 

BU590898/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BU590898 919 bp mRNA linear EST 20-SEP-2002 

AGENCOURT_8798355 NIH_MGC_137 Mus musculus cDNA clone IMAGE : 643 0691 
5', mRNA sequence. 
BU590898 

BU590898.1 GI:23242434 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 919) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Gerard Gradwohl (PNAS 97 P1607-1611, 

cDNA Library Preparation: Catherine Lee, Endocrine Pancreas 
Consortium 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http: // image. llnl .gov 

Plate: IRBD8 row: e column: 12 



Craniata; Vertebrata ; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus. 



(MGC) 



2000) 



High quality sequence start: 23 
High quality sequence stop: 123. 
FEATURES Locat ion/Qual if iers 

source 1. .919 

/organism^ "Mus musculus" 

/mol_type="mRNA" 

/db_xref="taxon: 10090" 

/clone=" IMAGE: 6430691" 

/lab_host=" DH10B" 

/clone_lib="NIHJV[GC_137 n 

/note="Organ: pancreas; Vector: pSPORTl; Site_l: Sail; 
Site_2: Not I; Library consists of a pool of clones 
rearrayed from the following libraries: Melton normalized 
mixed mouse pancreas 1 N1-MMS1, Amplified Melton mouse 
islets 1 MIS1-A, and Kaestner ngn3 wt. Clones rearrayed in 
the laboratory of K. Kaestner (University of Pennsylvania 
). Note: this is a NIH_MGC Library." 

BASE COUNT 235 a 208 c 162 g 314 t 

ORIGIN 

Query Match 18.3%; Score 68 4.6; DB 13; Length 919; 

Best Local Similarity 88.5%; Pred. No. 4.1e-98; 

Matches 794; Conservative 0; Mismatches 84; Indels 19; Gaps 4; 
Qy 805 CCTGTGATACCCTCCTCTG- - CAGAAAAAATTATGGATTTGATGGAGCA-GCCAGGTAAC 8 61 

MM MINI I II 1 1 1 1 lllllllllllll MINI MINIMI 

Db 8 97 CCTGTGATACCCTCCCTTGACAGAAAAATATTATGGATTTGAAGGAGCACCCCAGGTAAC 838 

Qy 8 62 ACTGTTTCGTCTGGTC-AAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCT 92 0 

MIIIIMI III Ml IMMMMMMM II MIMIII Ml MM 

Db 837 CATGTTTCGTCATGTCTAAGAGGATTTCCCATCTGTCGTGTATGAAACTGTTGCCTCTCT 778 

Qy 921 TCCTTCTCTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTT 98 0 

MM I III I II II i 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 [ 1 1 1 [ MM I III! Ml 

Db 777 TCCTGGTATATATACTATCCCAACTGTTTCTTTTAAAGAACACGGATTCTTTGGTAACTT 718 

Qy 981 ATCAG CAGTGT CAT CCT CAGAAGGAACAATTGAAGAAACTTTAAATGAAG CTTCTAAAGA 104 0 

MIIIIMI f 1 1 1 1 i J I J 1 1 f 1 1 1 i lllllll II II III II I IMIIJ II 1 1 III 

Db 717 ATCAGCAGTGGCATCCTCAGAAGGAACTATTGAAGTAACTTTAAATGAAGCTTCTAGAGA 658 

Qy 1041 GTTGCCAGAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAGAATT 1100 

II MMMMMIMM IMIIMMIMI Mill I llllll Mill III 

Db 657 ATTCCCAGAGAGGGC^CAAATCC^TTTGTAAATAGAGAGTCAGCAGAGTTTTCAGTATT 598 

Qy 1101 AGAATATTCAGAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCATATT 1160 

MINI llllllllllllllllllll || 1 1 II M I I: I M ! || 

Db 597 AGAATAGTCAGAAATGGGATCATCTTTCAATGGGTCCCCAAAAGGAGAGTCAGCCATGTT 538 

Qy 1161 AGTAGAAAACACTAAGGAAGAAGTAATTGTGAGGAGTAAAGACAAAGAGGATTTAGTTTG 122 0 

1 1 1 i 1 1 i I ! 1 1 lllllll lllllll MIMI Illlllll lllllll IIMIIIMM 

Db 53 7 AGTAGAAAACATTAAGGAAGAAGTAATTGTGAGGAGTAAAGGCAAAGAGGATTTAGTTTG 478 
Qy 1221 TAGTGCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGA 1265 

lllllllllllllll I MIIIIMI llllll 1 1 1 1 llllll 

Db 477 TAGTGCAGCCCTTCATAATCCACAAGAGTCACCTGCGTCCCTTACTAAAGTGGTTAAAGA 418 



Qy 



1266 AGACAGAGTTGTGTCTCCAGAAAAGAf^TGGAQ^TTTTTAATGAAATGCAGATGTCAGT 1325 



II I Mill I 1 1 1 i i 1 1 i I . I ! . I i I ^ ! Mill 1 1 1 1 1 1 1 1 1 1 I MINIM 

Db 417 AGTCGGAGTTATGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGAAAATGTCAGT 358 

Qy 1326 AGTAG CAC CTGTGAGGGAAGAGTATGCAGACTTTAAG CCATTTGAACAAG CATGGGAAGT 13 8 5 

lllllllll IMIIIMIIIIIIIIIII i 1 1 1 1 1 1 1 1 1 1 M II I III I ! 1 1 M 

Db 357 GGTAGCACCTCTGAGGGAAGAGTATGCAGATTTTAAG CCATTTGAACAAG CATGGGAAGT 2 98 

Qy 1386 GAAAGATACTTATGAGGGAAGTAGGGATGTGCTGGCTGCTAGAGCTAATGTGGAAAGTAA 144 5 

IIIIIIIIIIIIIIIIIMIMIIMIMI MM MMMM MM II IIMI 

Db 2 97 GAAAGATACTTATGAGGGAAGTAGGGATGTTCTGGTTGCTAGAGATAATATGGAAAGTAA 23 8 

Qy 1446 AGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGGAAGGATAGTGA 15 05 

Mill II lllllllll ; 1 1 : 1 1 : 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 MIMIIIIIMIII 

Db 237 AGTGGG CAAAAAATG CTTTGAAGATAGCCTGGAG CAAAAAAGT CATGGGAAGGATAGTGA 17 8 

Qy 1506 AGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGACAGCTCCAG 1565 

I IIMIII: Ml 1 1 i 1 1 1 1 1 i I M 1 1 1 1 1 1 1 i 1 1 1 MMMM IIMIII 

Db 177 AAGCAGAAATGAGAATGGTTCTTTCCCCAGTACCCCAGAACTTGTGAAGGACGGCTCCAG 118 

Qy 1566 AGCATATATTACCTGTGCTTCCTTTACCTCA^ 1625 

Ml II II MUM III MMIMiMM I II M lllllllll Ml 

Db 117 AGCGTACATO^CCTGTGATTCCTTTACCTCAGCAACCGAGAGTACTGCAGCAAACATTTT 58 

Qy 1626 CCCTTTGTTAGAAGAT(^TACTT(^GAAAATAAAACAGATGAAAAAAAAATAGAAGA 1682 

MM II MMMM II MMMM 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I II I 

Db 57 CCCTGTGCTAGAAGATCACACTTCAGAAAATAACACAGATGAAAAAAAAAAAAAAAA 1 



RESULT 10 

BU709106 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 



BU709106 778 bp mRNA linear EST 26-NOV-2002 

UI-M-EW0-caz-g-18-0-UI .rl NIH_BMAP_EW0 Mus musculus cDNA clone 
IMAGE: 6419369 5 f , mRNA sequence. 
BU709106 

BU709 106.1 GI : 2364224 7 
EST . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus . 
1 (bases 1 to 778) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. James Lin, Univeristy of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : // image . llnl . gov 
This clone was contributed by the Brain Molecular Anatomy Project 

(BMAP) 

Seq primer: pYX-5. 

Location/Qualifiers 



source 1. .778 

/organism="Mus musculus" 
/mol_type= n mRNA" 
/strain="C57BL/6" 
/db_xref="taxon: 10090" 
/clone=" IMAGE: 6419369" 
/tissue_type=" whole brain" 
/dev_stage=" embryo 15.5 dpc" 
/lab_host= M DH10B (Tl phage resistant) " 
/clone_l ib- "NI H_BMAP_EW0 " 

/note="0rgan: brain; Vector: pYX-Asc; Site_l: EcoR I; 
Site_2 : Not I; The library was constructed according to 
Bonaldo, Lennon and Soares, Genome Research, 6:791-806, 
1996. Denatured mRNa was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with an 
oligo-dT primer containing a Not I site. Double stranded 
cDNA was size selected according to mRNA size fraction, 
ligated with EcoR I adaptor, digested with Not I, and then 
cloned directionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
, is GTGCGTGGAA. This library was created for the 
University of Iowa Mouse Brain Molecular Anatomy Project 
(BMAP) : 'Gene Discovery in the Developing Mouse Nervous 
System", supported by National Instututes of Mental Health 
(NIMH), Hemin Chin, Ph.D., program coordinator." 

BASE COUNT 250 a 169 c 167 g 190 t 2 others 

ORIGIN 



Query Match 18.0%; Score 673.6; DB 13; Length 778; 

Best Local Similarity 93.9%;' Pred. No. 2.3e-96; 

Matches 711; Conservative , 0; Mismatches 45; Indels 1; Gaps 1; 



Qy 


1753 


GATTCTGAGGCAGATTATGTTACAACAGATACCTTATCAAAGGTGACTGAGGCAGCAGTG 

llllllllllllllllllll Mllllllll 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 MM 

GATTCTGAGG(^GATTATGTCAC^CAGATAATTTAT(^AGGTGACTGAGGCAGTAGTG 


1812 


Db 


2 


61 


Qy 


1813 


TCAAAC^TGCCTGAAGGTCTGACGCCAGATTTAGTTCAGGAAGaVTGTGAAAGTGAACTG 

Ml MMMMIMMM M M M M M M M M M M M M 1 M 1 M M M M M 1 

GCAACCATGCCTGAAGGTCTAACGCCAGATTTAGTTCAGGAAGCATGTGAAAGTGAACTG 


1872 


Db 


62 


121 


Qy 


1873 


AATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAAGTG 

M MMIMMMMMMMMMMMMMMMMMMMMMMI MMM 

AACGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAAG 


1932 


Db 


122 


181 


Qy 


1933 


GAAGCTATACAAGAATCACTTTACCCCACAGCA 

MIMMMMMI Ml 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 II 1 M 1 1 

GAAGCTATAC7W3AGTCAATTTACCCCACAGCA 


1992 


Db 


182 


241 


Qy 


1993 


GAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGCACCATTAAATTCTCTC 


2052 


Db 


242 


1 1 II 1 M 1 1 1 1 1 II 1 II 1 1 1 1 1 1 M 1 1 II 1 M 1 1 II 1 M II 1 1 1 IMIIIIIM 

GAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGCGCCATTAAATTCTCTC 


301 


Qy 


2053 


CTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATCCCCACTGGAAGCACCTCCT 

II MM MIMIMMMIII Ml Mill! IMIIMM MM Ml II 

CTTCCAAGC^CTGGTGCTTCTGTAGCGO^GCCCAGTGCATCCCCACTAGAAGTACCGTCT 


2112 


Db 


302 


361 



Qy 2113 CCAGTTAGTTATGACAGTATAAAG CTTGAG C CTGAAAACCCCCCACCATATGAAGAAG CC 2172 



Mi MINIMI 1 1 i 1 1 i 1 1 1 1 1 1 I M I M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 

Db 362 CCAGTTAGTTATGACGGTATAAAGCTTGAGCCTGAAAATCCCCCACCATATGAAGAAGCC 421 

Qy 2173 ATGAATGTAG CACTAAAAG CTTTGGGAACAAAGGAAGGAATAAAAGAGCCTGAAAGTTTT 2232 

1 1 1 1 Mill IMM I I II MINIM III III NININNIM 

Db 422 ATGAGTGTAGCACTAAAAACATCGGACTCAAAGGAAGAAATTAAAGAGCCTGAAAGTTTT 481 

Qy 2233 AATG CAG CTGTT CAGGAAACAGAAG CTC CTTATATATCCATTG CGTGTGATTTAATTAAA 22 92 

III INN INN I i I i 1 1 1 1 : 1 1 1 1 1 1 1 1 Mill Illlllll MM 

Db 482 AATG CAG CTG CT CAGGAAG CAGAAG CTC CTTATATATCCATTG CATGTGATTTAATTAAA 541 

Qy 22 93 GAAACAAAGCT CTC CACTGAGC CAAGT CCAGATTTCTCTAATTATT CAGAAATAGCAAAA 2352 

MMI 1 1 1 IN II 1 1 1 N I N I II 1 1 I lllllllllllllllllllllllllll 

Db 54 2 GAAACAAAGCTCTCCACTGAGCCAAGTCCAGAGTTCTCTAATTATTCAGAAATAGCAAAA 601 

Qy 2353 TTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTGGAGGATTCCTCACCTGAATCTGAA 2412 

ii i i : 1 1 : 1 1 1 1 : i n in in mm minim iiiiiiiii 

Db 602 TTTGAGAAGTCGGTGCCTGATCACTGTGAGCTCGTGGATGATTCCTCACCCGAATCTGAA 661 

Qy 2413 CCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCCACAAACACAAGAGGAGGCT 2472 

MM M M 1 1 M I II I ! 1 1 i I I lllllllllll Mill INIINN III 

Db 662 CCAGTTGACTTATTTAGTGATGATTCAATTCCTGAAGT - CCACANACACAAGAGGAGGCT 72 0 

Qy 24 73 GTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTCTG 2 509 

Illlllll 1 IMM MMI 

Db 721 GTGATGCTAATGAAGGAGAGTCTCACTGAAGTGTCTG 757 



RESULT 11 

CA320618 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 



CA320618 777 bp mRNA linear EST 26-NOV-2002 

UI-M-FW0-ccb-k-24-0-UI . rl NIH_BMAP_FW0 Mus musculus cDNA clone 
IMAGE: 6817393 5', mRNA sequence. 
CA320618 

CA320618 . 1 GI : 24 538742 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 777) 
NIH-MGC http : //mgc . nci . nih . gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares , University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

ht tp : / / image . 1 lnl . gov 
This clone was contributed by the Brain Molecular Anatomy Project 

(BMAP) 

Seq primer: pYX-5. 

Location/Qualifiers 



source 



BASE COUNT 
ORIGIN 



1. .777 

/organism="Mus mus cuius" 
/ mo 1 _ t yp e = " mRNA " 
/strain= M C57BL/6" 
/db_xref="taxon: 10090" 
/clone=" IMAGE: 6817393" 
/t is sue_type=" whole brain" 

/dev_stage= "embryo 13.5,14.5,16.5,17. 5dpc" 
/lab_host="DH10B (Tl phage resistant)" 
/ c 1 one_l ib= " N I H_BMAP_FW0 " 

/note="Organ: Brain; Vector: pYX- Asc; Site_l : EcoR I; 
Site_2: Not I; The library was constructed according 
Bonaldo, Lennon and Soares, Genome Research, 6:791-806, 
1996. Denatured RNA was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with oligo-dT 
primer containing a Not I site. Double strand cDNA was 
size selected according to mRNA size fraction, ligated 
with EcoR I adaptor, digested with NotI and then cloned 
directionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
is AGCGAGACAG. This library was created for the University 
Iowa Brain Anatomy Project (BMAP) : 'Gene Discovery in the 
Developing Mouse Nervous System' , supported by National 
Institute of Mental Health (NIMH), Hemin Chin, Ph.D., 
program coordinator." 
267 a 153 c 151 g 205 t 1 others 



Query Match 17.7%; 
Best Local Similarity 92.1%; 
Matches 724; Conservative 



Score 662.6; DB 14; Length 777; 
Pred. No. 1.2e-94; 
0; -Mismatches 50; Indels 12; 



Gaps 



2; 



Qy 214 7 AAAAC CCCCCAC CATATGAAGAAG C CATGAATGTAGCACTAAAAG CTTTGGGAACAAAGG 2206 

1 1 1 1 IIIIIIIIIIIIIIIIIMIIIIII II MINIM I I II III 

Db 1 AAAATCCCCCACCATATGAAGAAGCCATGAGTGTAGCACTA 60 



Qy 22 07 AAGGAATAAAAGAGCCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATA 2266 

ill ill i iiiiii mum 11 iiiiii Miiiii iiiii Mini 

Db 61 AAGAAATTAAAGAG C CTGAAAGTTTTAATG CAGCTGCT CAGGAAG CAGAAG CTC CTTATA 12 0 

Qy 2267 TATCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATT 232 6 

MINIUM IMM MIMI II IIIIII MIMMMMIMM MIMi I 

Db 121 TATCCATTGC^TGTGATTTAATTAAAGAAACAAAGCTCTCCA.CTGAGCCAAGTCCAGAGT 18 0 

Qy 2327 TCTCTAATTATTCAGAAATAG CAAAATTCGAGAAGTCGGTG CCCGAACACG CTGAG CTAG 2386 

1 1 1 1 Mill! 1 1 1 ! 1 1 ; : 1 1 1 1 1 I i - I < ) 1 1 1 1 1 1 1 i 1 1 II Ml IIIIII I 

Db 181 TCTCTAATTATTCAGAAATAG CAAAATTTGAGAAGTCGGTGCCTGATCACTGTGAGCTCG 24 0 

Qy 23 87 TGGAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTG 244 6 

MM MM IMM MIMI IMIIIMI MIMMMMMM MM 

Db 241 TGGATGATTCCTCACCCGAATCTGAACCAGTTGACTTATTTAGTGATGATTCAATTCCTG 3 00 

Qy 2447 AAGTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGT 2506 

IMM M MIMI IMIIIMI Mill 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 

Db 3 01 AAGTCCC^O^CACAAGAGGAGGCTGTGATGCTAATGAAGGAGAGTCTCACTGAAGTGT 3 60 

Qy 2507 CTGAGACAGTAG C C CAG CACAAA - - - GAGGAGAGACTTAGTG CCTCAC CTCAGGAG CTAG 2563 



Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



III I II MINI I I II I 'I I II MM III 

3 61 CTGAGACAGTAACACAAC^CAAACATAAGGAGAGACTTAGTGCTTC^ 42 0 
2564 GAAAGCCATATTTAGAGTCTTTTCA.GCCCAATTTACATAGTACAAAAGATGCTGCATCTA 2623 

I II 1 1 1 1 1 II ; 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 II llllllll II IMIIIMM 

421 GAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATATTACAAAAGATGCTGCATCTA 480 

2624 ATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTTAATA 2683 

I I I I I I II III II I II I I IM I I! II II llllllll II IMMMIMIIIIII I 

4 81 ATGAAATTCCAAC^TTGACCAAAAAGGAGAC^TTTCTTTGCAAATGGAAGAGTTTAATA 54 0 

2684 CTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTG 2743 



: I M I 1 I I I I I I II 



II IIIIIIMI IIIIIMI 



IIIIMI 



541 CTGCAATTTATTCCAATGATGACTTACTTTCTTCTAAGGAAGACAAAATGAAAGAAAGTG 600 
2744 AAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTG 28 03 

I 1 1 ! I ! : 1 1 1 1 : - 1 1 1 1 1 ! IIIIIIMI MM llllllll IIIIIIMII 

6 01 AAACATTTTCCGATTCATCTCTCATTGAGATAATAGATGAGTTTCCCACATTTGTCAGTG 660 
28 04 CTAAAGATGATTCTCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCCGACAAAA 28 63 

. I ! > 1 1 1 1 1 1 1 1 1 1 1 MIMM MM llllllllllll MINI 

661 CTAAAGATGATTCTCCT AAGGAGTACACTGAC CTAGAAGTATCCAACAAAA 711 

28 64 GTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTG 2 923 

I I ; 1 1 1 1 1 - 1 1 : 1 1 1 1 Mill I 1 1 1 1 IIIIIIMII MM IIIIMM 

712 GTGAAATTGCTAATGTCCAGAGCGGNGGCAATTCGTTGCCTTGCTCAGAATTGCCCTGTG 771 

2 924 ACCTTT 2 92 9 

llllll 
772 ACCTTT 777 



RESULT 12 

CA320635 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



EST 26-NOV-2002 
cDNA clone 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus . 



CA320635 802 bp mRNA linear 

UI-M-FW0-CCb-O-24-0-UI .rl NIH_BMAP_FW0 Mus musculus 
IMAGE: 6817489 5*, mRNA sequence. 
CA320635 

CA320635. 1 GI : 2453 8 75 9 
EST . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 802) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : // image . llnl . gov 



This clone was contributed by the Brain Molecular Anatomy Project 
(BMAP) 

Seq primer: pYX-5. 
FEATURES Locat ion/ Qual i f iers 

source 1. .8 02 

/organism="Mus musculus" 
/ mo 1 _t yp e = " mRNA " 
/strain="C57BL/6" 
/db_xref="taxon: 10090" 
/clone- "IMAGE: 681748 9" 
/tissue_type= "whole brain" 

/dev_stage=" embryo 13.5,14.5,16.5,17. 5dpc" 
/lab_host="DH10B (Tl phage resistant)" 
/clone_l ib= "NIH_BMAP_FW0 " 

/note="0rgan: Brain; Vector: pYX- Asc; Site_l: EcoR I; 
Site_2: Not I; The library was constructed according 
Bonaldo, Lennon and Soares, Genome Research, 6:791-8 06, 
1996. Denatured RNA was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with oligo-dT 
primer containing a Not I site. Double strand cDNA was 
size selected according to mRNA size fraction, ligated 
with EcoR I adaptor, digested with NotI and then cloned 
directionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
is AGGGAGACAG . This library was created for the University 
Iowa Brain Anatomy Project (BMAP) : 'Gene Discovery in the 
Developing Mouse Nervous System' , supported by National 
Institute of Mental Health (NIMH) , Hemin Chin, Ph.D., 
program coordinator." 

BASE COUNT 275 a 157 c 154 g 213 t 3 others 

ORIGIN 

Query Match 17.7%; Score 660.4; DB 14; Length 8 02; 

Best Local Similarity 91.6%; Pred. No. 2.7e-94; 

Matches 745; Conservative 0; Mismatches 54; Indels 14; Gaps 4; 



Qy 2148 AAACCCCCCACCATATGAAGAAGCCATGAATGTAGCACTAAAAGCTTTGGGAACAAAGGA 22 07 

III I IIIMI IMIMII MM IMIMIIIIIII I I II lllllll 

Db 1 AAAATCCCCACCATATGAAGAAGCCATGAGTGTAGCACTA 60 

Qy 22 08 AGGAATAAAAGAGC CTGAAAGTTTTAATGCAG CTGTT CAGGAAACAGAAGCTC CTTATAT 2267 

II III MIMMIMM MMMMMIMM lllllll MINIMI III II I 

Db 61 AGAAATTAAAGAGCCTGAAAGTTTTAATGCAGCTGCTCAGGAAGCAGAAGCTCCTTATAT 12 0 

Qy 2268 ATCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTT 2327 

IMIMII 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 !1 1 1 1 1 1 1 j I £ 1 1 1 1 1 1 1 1 II 

Db 121 ATCCATTGCATGTGATTTAATTAAAGAAACAAAGC 18 0 

Qy 2328 CTCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGT 2387 

M! M MMM MMMMMM 1 1 1 II 1 1 1 1 1 1 1 1 1 II III llllll II 

Db 181 CTCTAATTATTCAGAAATAGCAAAATTTGAGAAGTCGGTGCCTGATCACTGTGAGCTCGT 24 0 

Qy 23 8 8 GGAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGA 244 7 

III Mlllllllll MM MMM M MMMMMMMMM! IIMMII 

Db 241 GGATGATTCCTCACCCGAATCTGAACCAGTTGACTTATTTAGTGATGATTCAATTCCTGA 3 00 

Qy 2448 AGTCCCAQ^AACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTC 2507 



I MINI MINIMI MINI II III 1 1 1 1 1 1 1 II 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 

Db 3 01 AGTCCCACAAACACAAGAGGAGGCTGTGATGCTAATGAAGGAGAGTCTCACTGAAGTGTC 360 

Qy 2 508 TGAGACAGTAGCC CAG CACAAA GAGGAGAGACTTAGTGCCTCACCTCAGGAGCTAGG 2564 

IIIMIIIII I I I M I I I I I ' : I I I I I I I IIIIIIIMIM | | | | 

Db 361 TGAGACAGTAACACAACACAAACATAAGGAGAGACTTAGTGCTTCACCTCAGGAGGTAGG 42 0 

Qy 2565 AAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGCTGCATCTAA 2624 

I MIMI II llllll MMMMMNMM MMMMIMM! IMM 

Db 421 AAAGCCATATTTAGAGTCTTTTCAG C C CAATTTACATATTACAAAAGATG CTG CATCTAA 48 0 

Qy 2625 TGACATTCC^CATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTTAATAC 2684 

III IMMM MIMI MUM I Mill MMMMMMIMM MM: 

Db 481 TGAAATT C CAACATTGACCAAAAAGGAGACAATTTCTTTG CAAATGGAAGAGTTTAATAC 54 0 

Qy 2685 TGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTGA 2744 

II MIMI I IMM MIMMIMMMM IIIIIIIMM III Mill 

Db 541 TGCAATTTATTCCAATGATGACTTACTTTCTTCTAAGGAAGACAAAATGAAAGAAAGTGA 600 

Qy 2745 AACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGC 28 04 

MINIMI I IMIMMIIMIMM MM 1 1 1 IN 

Db 601 AACATTTTCCGATTCATCTCNCATTGAGATAATAGATGAGTTTCNCACATTTGTCAGTGC 660 

Qy 28 05 TAAAGATGATTCTCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCCGACAAAAG 2864 

M IMMMII III 1 1 1 1 j I E 1 1 1 1 1 1 1 llllllllllll III 1 1 1 

Db 661 TAAAGATGATTCTCCT AAGGAGTACACTGACCTAGAAGTATCCAACACAAG 711 

Qy 2865 TGAAATTG CTAATAT CCAAAGCGGGG CAGATTCATTG CCTTG CTTAGAATTGCC CTGTGA 2924 

1 1 1 1 ; 1 1 i 1 1 1 1 1 1 1 MINIM MM I IMMM III II! MIMI! 

Db 712 TGAAATTGCTAATGTCCAGAGCGGGGCCAATTCGTTGCCTTGCTCAGATTTG - CCTGTGA 770 

Qy 2 925 CCTTTCTTTCAAGAATATATATCCTAAAGATGA 2 957 

M IMMM I Ml ! IMMMII m 

Db 771 CCTTTCTTTCANG - ATACATATCCTAAAGATGA 8 02 



RESULT 13 

BQ892001 

LOCUS 

DEFINITION 

ACCESSION 
VERSION. 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



linear EST 16-AUG-2002 
CDNA clone IMAGE : 6315079 



BQ892001 951 bp mRNA 

AGENCOURT_8 75834 7 NIH_MGC_129 Mus musculus 
5 1 , mRNA sequence . 
BQ892001 

BQ892001.1 GI:22284015 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 951) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Susan L. Sullivan, PhD. 
cDNA Library Preparation: ResGen, Invitrogen Corp 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus. 



DNA Sequencing by: Agencourt Bioscience Corporation 

Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : //image . llnl . gov 

Plate: LLAM13744 row: n column: 08 
High quality sequence start: 6 
High quality sequence stop: 629. 
FEATURES Location/Qualifiers 
source 1. .951 

/organism= f, Mus musculus" 

/ mo 1 _ t yp e = " mRNA " 

/db_xref ="taxon: 10090" 

/clone=" IMAGE: 6315079" 

/lab_host="DH10B (phage-resistant) " 

/ c 1 one_l ib= " N I H _MGC_1 2 9 " 

/note= "Organ : olfactory epithelium; Vector: 
pCMV-SPORT6.1.ccdb; Site_l : EcoRV; Site_2 : Not I; Cloned 
unidirectionally . Primer: Oligo dT. Average insert size 
2.2 kb. Constructed by ResGen, Invitrogen Corp. Note: this 
is a NIH_MGC Library." 

BASE COUNT 279 a 186 c 222 g 263 t 1 others 

ORIGIN 

Query Match 17.6%; Score 659.2; DB 13; Length 951; 

Best Local Similarity 89.3%; Pred. No. 4.1e-94; 

Matches 780; Conservative 0; Mismatches 79; Indels 14; Gaps 6; 
Qy 264 6 AAAGGAGAAAATTT CTTTG CAAATGGAAGAGTTTAATACTG CAATTTATTCAAATGATGA 2705 

Ml I MIIIIIIIIIMIMIMIIIIMMIIIIIMIIIMIMI III III 

Db 11 AAAAGGAGAAATTTCTTTGCAAATGGAAGAGTTTAATACTGCAATTTATTCCAATGATGA 70 

Qy 2706 CTTACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTGAAACATTTTCAGATTCATCTCC 2765 

I I I I : I I I I I I I I I I . I 1 1 1 1 1 1 1 M llllll IIIIIIIM Mill III 

Db 71 CTTACTTTCTTCTAAGGAAGACAAAATGAAAGAAAGTGAAACATTTTCCGATTCATCTCC 13 0 

Qy 2766 GATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGCTAAAGATGATTCTCCTAAATT 2825 

II II MINI III llllll 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 E 1 1 1 ! i 1 1 1 [ 

Db 131 CATTGAGATAATAGATGAGTTTCCCACATTTGTCAGTGCTAAAGATGATTCTCCT 18 5 

Qy 2826 AGCCAAGGAGTACACTGATCTAGAAGTATCCGACAAAAGTGAAATTGCTAATATCCAAAG 28 85 

IIIMIIIIIIIII M llllll I M : 1 1 1 M I III 1 1 , 1 ; I 1 1 1 1 II 

Db 18 6 AAGGAGTACA.CTGACCTAGAAGTATCCAACAAAAGTGAAATTGCTAATGTCCAGAG 241 

Qy 288 6 CGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAATATATA 2 94 5 

llllll 1 1 1 1 llllllllll INI llilllMhllllll llllll II, III 

Db 242 CGGGGCCAATTCGTTGCCTTGCTCAGAATTGCCCTGTGACCTTTCTTTCAAGAATACATA 3 01 

Qy 2946 TCCTAAAGATGAAGTACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTATCTAA 3 005 

IIIMIIIIIIIII llllll MM MINIMI III 1 1 1 1 ! 1 1 1 1 1 1 1 1 M I 

Db 3 02 TCCTAAAGATGAAGCA.CATGTCT(^GATGAATTCTCCAAAAGTAGGTCCAGTGTATCTAA 361 

Qy 3006 GGCATCCATATOSCCTTCAAATGTCTCTC 3 065 

II II III II I llllll! IIIIIIIM I I I I I lllllllllll III 

Db 362 GGTGCCCTTATTGCTTCCAAATGTTTCTGCTTTGGAATCTCAAATAGAAATGGGCAACAT 421 

Qy 3 066 AGTTAAATCCAAATCACTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTGAC^C^GAGAA 3125 

MM Mill IIIMIIIIIIIII 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 MM 



Db 



422 AGTTAAACCCAAAGTACTTACGAAAGAAGCAGAGGAAAAACTTCCTTCTGATACAGAGAA 4 81 



Qy 312 6 AGAGGACAGATGCCTGTCAGCTGTATTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGA 318 5 

IIIIIIIIIIIIIIII Illlllllllllllllllllllll llllllllllllllllll 

Db 482 AGAGGACAGATCCCTGACAG CTGTATTGT CAG CAGAG CTGAATAAAACTT CAGTTGTTGA 541 

Qy 3186 CCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCT 324 5 

MINI MM IIIIIIIIIIIII II MMII MIMMMMIMM II Ml 

Db 54 2 CCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCT 601 

Qy 324 6 GCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTG - CCTTGGCCC 33 04 

MM MMII MMIMMMIMM MUM MIIMM M lllllllll 

Db 602 GCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCCTTGGCCC 661 

Qy 3305 TGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAGAAAT 3364 

MM; III II IIIIIIIIIIIIIIII Mllllll IIIIIIIIIIIII 

Db 662 TGCTCTCTGTGACTATNCAGCTTAGGATATATAAGGGTGTGATCCAAGCTATGCAGAAAT 721 

Qy 3365 CAGATGAAGGCCA - CCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCAGAGGAA 3423 

I II I Ml I IN 1 1 III llllll III Mill I llllllllllll Mllllll Ml 

Db 722 CAGATGAAGGCCACCCCTTTCAGGGCATATTTGGGATCTGAAGTTGCCATATCAGAAGAA 781 

Qy 3424 TTGG - TTCAGAAATACA - GTAATTCTGCTCTTGGTCATGTGAACAG - CACAATAAAAGAA 348 0 

MM MIIIIMM I I IMM M Ml MM I M I MMII II II 

Db 782 TTGGTTTCAGAAATATAGGAAATTCTGCTCTTGGGCATGGGGACCGCCACAATAAAAGAA 841 

Qy 34 81 CTGAGGCGGCTTTTCTTAGTTGATGATTTAGTT 3513 

Db 842 ATGGAGGGGGTCTCCTCCTAAGTTAATGGATTT 874 



RESULT 14 

BU612951 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus . 



BU612951 739 bp mRNA linear EST 20-FEB-2003 

UI-M-FR0-cbd-a-04-0-UI . rl N I H_BMAP_FR 0 Mus tnusculus cDNA clone 
UI-M-FR0-cbd-a-04-0-UI 5', mRNA sequence. 
BU612951 

BU612951.1 GI:23279166 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata ; 
Mammalia; Eutheria; Rodent ia; 
1 (bases 1 to 739) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be obtained 

from Dr. M. Bento Soares, bento-soares@uiowa.edu 
This clone was contributed by the Brain Molecular Anatomy Project 

(BMAP) 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Seq primer: pYX-5. 

Location/Qualif iers 
1. -739 

/organism= n Mus musculus" 
/mol_type= ,, mRNA" 
/strain= !f C57BL/6 ff 
/db_xref ="taxon: 10090" 
/clone= M UI-M-FR0-cbd-a~04-0-UI " 
/tissue_type= "whole brain" 

/dev_stage=" embryo 13 . 5 , 14 . 5 , 16 . 5 , 17 . 5dpc" 
/lab_host="DH10B (Tl phage resistant)" 
/clone_l ib= "NIH_BMAP_FR0 " 

/not e=" Organ: Brain; Vector: pYX- Asc; Site_l: EcoR I; 
Site_2 : Not I; The library was constructed according 
Bonaldo, Lennon and Soares, Genome Research, 6:791-806, 
1996. Denatured RNA was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with oligo-dT 
primer containing a Not I site. Double strand cDNA was 
size selected according to mRNA size fraction, ligated 
with EcoR I adaptor, digested with NotI and then cloned 
directionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
is AGCGAGACAG. This library was created for the University 
Iowa Brain Anatomy Project (BMAP) : 'Gene Discovery in the 
Developing Mouse Nervous System', supported by National 
Institute of Mental Health (NIMH), Hemin Chin, Ph.D., 
program coordinator." 
224 a 148 c 162 g 205 t 



Query Match 17.3%; Score 648.6; DB 13; Length 739; 

Best Local Similarity 92.9%; Pred. No. 2e-92; 

Matches 694; Conservative 0; Mismatches 44; Indels 9; 



Gaps 



1; 



Qy 2721 GGAAGAOWy\TAAAAGAAAGTGAAA(^TTTTC^GATTCATCTCCGATTGAGATAATAGA 278 0 

MINIMI II M IMIIMM Mill MINIMI I 1 1 1 1 M I M II 1 1 

Db 2 GGAAGACAAAATGAAAGAAAGTGAAACATTTTCCGATTCATCTCCCATTGAGATAATAGA 61 



Qy 2 781 TGAATTTCCCACGTTTGTCAGTGCTAAAGATGATTCTCCTAAATTAGCCAAGGAGTACAC 284 0 

III MUM Ml Ml 1 1 1 1 1 ; 1 1 1 : : I 1 1 1 1 1 1 lllllllllll 

Db 62 TGAGTTTCCCACATTTGTCAGTGCTAAAGATGATTCTCCT AAGGAGTACAC 112 

Qy 2841 TGATCTAGAAGTATC CGACAAAAGTGAAATTGCTAATATC CAAAGCGGGG CAGATT CATT 2 900 

III IIIIIIIIMII Illlllll INNI III Ml MINIM Ml I! 

Db 113 TGACCTAGAAGTATCCAACAAAAGTGAAATTGCTAATC 172 

Qy 2 901 GCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAGT 2960 

Illlllll : 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i M II! 1 1 1 1 I 1 1 1 1 1 1 : 1 1 1 1 ; I I 

Db 173 GCCTTGCTCAGAATTGCCCTGTGACCTTTCTTTCAAGAATACATATCCTAAAGATGAAGC 232 

Qy 2 961 ACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCC 302 0 

INN 1 1 1 1 1 1 II II I M M Ml i 1 1 1 1 1 1 1 1 1 i 1 1 f 1 1 1 1 E I II Ml II 

Db 233 ACATGTCTCAGATGAATTCTCCAAAAGTAGGTCCAGTGTATCTAAGGTGCCCTTATTGCT 292 

Qy 3 021 TTCAAATGTCTCTGCTTTGGAACCTCAGACAGAAATGGG 308 0 

I III 1 1 II Mill Ml II 1 1 MM I I III 1 1 MM Illlllll II MM 

Db 2 93 TCCAAATGTTTCTGCTTTGGAATCTCAAATAGAAATGGGCAACATAGTTAAACCCAAAGT 352 



Qy 3081 ACTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCT 3140 

lllllllilllllllll II III IIIIMIIIIII II 1 1 1 Ml 1 1 II III I II II 1 1 1 

Db 3 53 ACTTACGAAAGAAG CAGAGGAAAAACTT CCTTCTGATACAGAGAAAGAGGACAGATC CCT 412 

Qy 3141 GTCAGCTGTATTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAG 32 00 

I 1 1 1 1 1 1 1 ! I ! 1 1 1 1 : 1 1 1 1 1 ; I 1 1 1 1 1 1 1 1 1 [ I 1 1 1 1 1 1 i I . MINIM 

Db 413 GACAGCTGTATTGTCAGCAGAGCTGAATAAAACTTCAGTTGTTGACCTCCTGTACTGGAG 472 

Qy 32 01 AGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGAC 3260 

I Ml MM I MINI 1 1 Mill I III III Mill II III II III II III MINI Mill 

Db 4 73 AGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGAC 532 

Qy 3261 AGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTAT 332 0 

1 1 1 1 [ 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 i ! 1 1 M I ! 1 1 1 1 i 1 1 1 1 1 1 Mill I 

Db 533 AGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTAT 592 

Qy 3321 CAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCCACCC 338 0 

I I 1 1 1 i 1 1 : 1 Mill. 1 1 1 1 III! IM M IIIIIIIIMIIMl IIIIIMI 

Db 593 CAGCTTTAGGATATATAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCC 652 

Qy 3381 ATTCAGGG CATATTTAGAAT CTGAAGTTG CTATATCAGAGGAATTGGTTCAGAAATACAG 3440 

IMMMMIMIM II I 1 1 ! 1 1 1 : 1 ; 1 1 1 1 1 1 1 1 I ! 1 1 4 1 M 

Db 653 ATTCAGGG CATATTTGGAAT CTGAAGTTG CCATATCAGAGGAATTGGTTCAGAAATATAG 712 

Qy 3441 TAATTCTGCTCTTGGTCATGTGAACAG 3467 

Illllll MMMMIMM 

Db 713 TAATTCTGCTCTTGGTCATGTGAACAG 73 9 



RESULT 15 

CA320833 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



CA320833 742 bp mRNA linear EST 26-NOV-2002 

UI-M-FW0-ccb-n-23-0-UI . rl NIH_BMAP_FW0 Mus musculus cDNA clone 
IMAGE: 6817464 5', mRNA sequence. 
CA320833 

CA320833 . 1 GI :24538931 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 742) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M . Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

ht tp : / / image . 1 lnl . gov 
This clone was contributed by the Brain Molecular Anatomy Project 

(BMAP) 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Seq primer: pYX-5. 

Location/Qualifiers 
1. .742 

/ organ ism= n Mus mus cuius" 
/ mo 1 _ t yp e = " mRNA " 
/strain="C57BL/6" 
/db_xref="taxon: 10090" 
/clone= " IMAGE : 68174 64" 
/tissue_type= "whole brain" 

/dev_stage= " embryo 13.5,14.5,16.5,17. 5dpc " 
/lab__host = "DH10B (Tl phage resistant)" 
/clone_l ib= "NIH_BMAP_FW0 " 

/note==" Organ: Brain; Vector: pYX- Asc; Site_l: EcoR I; 
Site_2: Not I; The library was constructed according 
Bonaldo, Lennon and Soares, Genome Research, 6:791-806, 
1996. Denatured RNA was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with oligo-dT 
primer containing a Not I site. Double strand cDNA was 
size selected according to mRNA size fraction, ligated 
with EcoR I adaptor, digested with NotI and then cloned 
directionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
is AGCGAGACAG . This library was created for the University 
Iowa Brain Anatomy Project (BMAP) : 'Gene Discovery in the 
Developing Mouse Nervous System 1 , supported by National 
Institute of Mental Health (NIMH), Hemin Chin, Ph.D., 
program coordinator." 
227 a 149 c 154 g 212 t 



Query Match 17 . 13 

Best Local Similarity 92.63 
Matches 696; Conservative 



Score 638.4; DB 14; Length 742; 
Pred. No. 8.1e-91; 
0; Mismatches 46; Indels 10; Gaps 



2; 



Qy 2644 AAAAAGGAGAAAATTT CTTTG CAAATGGAAGAGTTTAATA CTG CAATTTATT CAAATGAT 2703 

II Mill Mill MIMI MUM II MMM II MMM I MINI 

Db 1 AATCTGGAGACAATTTCTTTGCAAATGGAAGAGTTTAATACTGCAATTTATTCCAATGAT 6 0 



Qy 2704 GACTTACTTTCTTCTAAGGAAGAC^UUUVTAAAAGAAAGTGAAAC^TTTTCAGATTCATCT 2 763 

Mill MIMIMMMMM III II IMIII il MM II I II I II I 

Db 61 GACTTACTTTCTTCTAAGGAAGACAAAATGAAAGAAAGTGAAACATTTTCCGATTCATCT 12 0 

Qy 2764 CCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGCTAAAGATGATTCTCCTAAA 2 823 

II Ml IMMMIMM II Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

Db 121 CCCATTGAGATAATAGATGAGTTTCCCACATTTGTCAGTGCTAAAGATGATTCTCCT 177 

Qy 2824 TTAG C CAAGGAGTACACTGAT CTAGAAGTATCCGACAAAAGTGAAATTGCTAATATCCAA 2 883 

Mill 1 1 MM II I I II I Ml II I II MM II I Mill Ml II 111 I II I 

Db 178 AAGGAGTACACTGACCTAGAAGTATCCAACAAAAGTGAAATTGCTAATGTCCAG 231 

Qy 2884 AGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAATATA 2 943 

MINI 1 1 1 1 MINIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 I 

Db 232 AGCGGGGCCAATTCGTTGCCTTGCTCAGAATTGCCCTGTGACCTTTCTTTCAAGAATACA 2 91 

Qy 2 944 TATCCTAAAGATGAAGTACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTATCT 3 003 

MMMIMMIMM III : 1 1 1 i 1 1 . 1 1 1 1 1 1 Ml IMMIIMI Ml 

Db 2 92 TATCCTAAAGATGAAGCAt^TGTCTC^GATGAATTCTCCAAAAGTAGGTCCAGTGTATCT 351 



Qy 3004 AAGGCATCCATATCGCCTTCAAATGTCTCTGCTTTGGAACCTCAGACAGAAATGGGCAGC 3 063 

1 1 1 1 II III II I lllllll MINIMUM MM I MINIMI I 

Db 352 AAGGTGCCCTTATTGCTTCCAAATGTTTCTGCTTTGGAATCTCAAATAGAAATGGGCAAC 411 

Qy 3 064 ATAGTTAAATCCAAAT CACTTACGAAAGAAG CAGAGAAAAAACTT CCTTCTGACACAGAG 3123 

MINI I Mill Mill MINI MM 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 III 

Db 412 ATAGTTAAACCCAAAGTACTTACGAAAGAAGCAGAGGAAAAACTTCCTTCTGATACAGAG 471 

Qy 3124 AAAGAGGACAGATCCCTGTCAGCTGTATTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTT 3183 

Mill IMIMIII ! 1 1 ! ! 1 1 1 1 1 1 1 1 1 1 1 1 i I MIMIIMI II I 

Db 472 AAAGAGGACAGATCCCTGACAGCTGTATTGTCAGCAGAGCTGAATAAAACTTCAGTTGTT 531 

Qy 3184 GACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTC 324 3 

Mill MMM llllll MMM II MMM MIMIMMMMMM 

Db 532 GACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTC 591 

Qy 3244 CTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTGGCC 3303 

I ! I i I I I I I i I I I I I i I I I I I I I I I I I I I . I I I : I I i I I I I I I I ! I I I I : ' I : I I 

Db 592 CTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTGGCC 651 

Qy 3304 CTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAGAAA 3363 

MIMM llllll llllll IIIMIMI Ml MMM I llllll M 

Db 652 CTGCT CT CTGTGACTATCAG CTTTAGGATATATAAGGGTGTGAT C CAAGCTATC CAGAAA 711 

Qy 33 64 T CAGATGAAGG C CAC C CATT CAGGG CATATTT 3395 

I III M MM : MMM M MM 

Db 712 TCAGATGAAGGCCA- C CATT CAGGG CATATTT 742 



Search completed: January 23, 2004, 14:46:28 
Job time : 7246.48 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



January 23, 2004, 00:42:43 ; Search time 896.477 Seconds 

(without alignments) 
11264.762 Million cell updates/sec 

US-09-830-972-1 
3741 

1 attgctcgtctgggcggcgg gattgaagcgcaaagcagat 3741 

I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 

2552756 seqs, 1349719017 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5105512 



Database : 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



>eneseq_19 Jun03 : * 

/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/S I DSl/gcgdat a/genes eq/geneseqn-embl /NA198 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
SIDSl/qcqdata/qeneseq/qeneseqn-embl/NA198 



/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/SIDSl/gcgdata/geneseq/gene " " ' ~ 



/SIDS 
/SIDS 
/SIDS 



/NA198 

, _ ^ ^ /NA198 

!l/gcgdata/geneseq/geneseqn-embl/NA19 
;i/gcgdata/geneseq/ geneseqn- embl /NA19 
, ^wJl/gcgdata/geneseq/geneseqn-embl/NA19 
/SIDSl/gcgdata/geneseq/geneseqn-embl /NA19 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA19 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA19 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA19. 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA19! 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA19! 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA19! 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA19! 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA20 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA20 
/ ~IDSl/gcgdata/geneseq/geneseqn-embl/NA2 0 
!IDSl/gcgdata/geneseq/geneseqn-embl/NA2 0 
!IDSl/gcgdata/geneseq/geneseqn-embl/NA2 0 



/S 
/S 

/s 



0 . DAT : * 

1 . DAT : * 

2 . DAT : * 

3 . DAT : * 

4 . DAT : * 

5 . DAT : * 

6 . DAT : * 

7 . DAT : * 

8 . DAT : * 
8 9 . DAT : * 

90. DAT:* 

91. DAT:* 

92 . DAT: * 

93 . DAT: * 

94 . DAT: * 

95 . DAT: * 

96 . DAT: * 

97 . DAT: * 

98 . DAT: * 

99. DAT:* 
00. DAT:* 
0 1 A . DAT : * 
01B . DAT : * 

02 . DAT: * 

03 . DAT: * 



Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

o. 

Result Query 



NO. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


3739 , 


.4 


100 , 


. 0 


4684 


21 


AAD01173 


Rat neurite growth 


2 


3739, 


.4 


100 . 


. 0 


4684 


24 


ABN86600 


Rat neurotransmitt 


3 


2343. 


.6 


62 , 


. 6 


4053 


22 


AAS09453 


Human cDNA encodin 


4 


2343 . 


.6 


62 . 


. 6 


4632 


24 


ABV94680 


Human pancreatic c 


5 


2333 , 


.2 


62 , 


. 4 


4093 


21 


AAA23454 


cDNA encoding huma 


6 


2323 . 


.8 


62 . 


. 1 


4822 


24 


ABS70449 


Human bone remodel 


7 


2297, 


.4 


61 , 


. 4 


4698 


25 


ABX34563 


Human mddt cDNA SE 


8 


2289, 


.2 


61 . 


. 2 


3579 


21 


AAZ56886 


Human MAGI polypep 


9 


2289, 


.2 


61 , 


. 2 


3579 


22 


AAF90324 


Human NOGO-A cDNA. 


10 


2289 , 


.2 


61 . 


. 2 


3579 


24 


ABN86601 


Human neurotransmi 


11 


2289 , 


.2 


61 . 


. 2 


3579 


24 


ABK90134 


DNA encoding human 


12 


1869 , 


.8 


50 , 


. 0 


3833 


21 


AAD01174 


Bovine neurite gro 


13 


1411, 


.2 


37 , 


. 7 


2386 


19 


AAV30920 


Human secreted pro 


14 


1408 


37 . 


. 6 


2386 


22 


AAF98399 


Human cDNA clone B 


15 


1088 , 


.8 


2 9 . 


. 1 


1980 


22 


AAI98079 


Human neuroblast otn 


16 


564 


15 . 


. 1 


1568 


21 


AAD01175 


Rat neurite growth 


17 


503 , 


.2 


13 . 


. 5 


1758 


22 


AAF32725 


Human secreted pro 


18 


497 , 


.4 


13 , 


. 3 


600 


22 


AAF90323 


Human NOGO-C cDNA. 


19 


497. 


.4 


13 , 


. 3 


770 


21 


AAA72983 


Human NSPH encodin 


20 


497 . 


.4 


13 . 


. 3 


799 


19 


AAV23695 


Human NSPLP protei 


21 


497 , 


.4 


13 . 


. 3 


1122 


21 


AAZ56888 


Human MAGI polypep 


22 


497 , 


.4 


13 , 


. 3 


1122 


22 


AAF90325 


Human NOGO-B cDNA. 


23 


497 , 


.4 


13 . 


. 3 


1213 


20 


AAX04379 


Human secreted pro 


24 


497 , 


.4 


13 . 


. 3 


1216 


24 


ABA05903 


Human RTN4B encodi 


25 


497 , 


.4 


13 . 


. 3 


2052 


24 


ABK90133 


DNA encoding human 


26 


497, 


.4 


13 . 


. 3 


2235 


24 


ABV94681 


Human pancreatic c 


27 


497 , 


.4 


13 , 


. 3 


2240 


21 


AAC64406 


Human Nogo B nucle 


28 


495, 


.8 


13 . 


. 3 


991 


20 


AAX97587 


Extended human sec 


29 


495 , 


.8 


13 . 


. 3 


1610 


21 


AAZ36230 


cDNA encoding a bo 


30 


495, 


.8 


13 , 


. 3 


1694 


22 


AAK94408 


Human full-length 


31 


483 . 


.6 


12 , 


.9 


868 


21 


AAZ56887 


Human MAGI polypep 


32 


475 


12 . 


.7 


1798 


24 


ABK90135 


DNA encoding human 


33 


468 


12 . 


.5 


1514 


24 


ABK34580 


Human cDNA for nov 


34 


391. 


.6 


10. 


.5 


1683 


22 


AAD08386 


Human secreted pro 


35 


375 . 


.6 


10. 


. 0 


422 


25 


ABX43312 


Bovine EST associa 


36 


374 


10. 


. 0 


422 


25 


ABX4 64 02 


Bovine EST associa 


37 


323 


8 . 


.6 


460 


20 


AAV87027 


EST clone BG160. 


38 


322. 


, 4 


8 . 


.6 


389 


25 


ABX39989 


Bovine EST associa 


39 


316. 


,2 


8. 


, 5 


615 


22 


AAK93939 


Human cDNA clone r 


40 


302 . 


,6 


8. 


. 1 


423 


25 


ABX43927 


Bovine EST associa 


41 


299 


8. 


.0 


562 


22 


AAK93574 


Human cDNA clone r 


42 


278 . 


,8 


7. 


, 5 


742 


22 


AAI96236 


Human neuroblastom 


43 


266. 


4 


7 . 


, 1 


668 


24 


ABL89601 


Human polynucleoti 


44 


258 . 


2 


6. 


9 


495 


22 


AAK92091 


Human cDNA 5 ' -end 


45 


253 . 


2 


6. 


,8 


3202 


19 


AAX75770 


Human neuroendocri 



ALIGNMENTS 



RESULT 1 
AAD01173 

ID AAD01173 Standard; CDNA; 4684 BP. 
XX 

AC AAD01173; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat neurite growth inhibitor Nogo A cDNA . 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma ; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprol iterative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; ss. 

XX 

OS Rattus sp. 
XX 

FH Key Locat ion/Qualif iers 

FT CDS 253.. 3744 

FT /*tag= a 

FT /product^ "Nogo A" 

FT /transl_except= (pos : 1462 . . 1464 , aarlle) 

XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 9 9WO-US2 6 160 . 
XX 

PR 06-NOV-1998; 98US-0107446 . 
XX 

PA (SCHW/) SCHWAB M E. 
PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
DR P-PSDB; AAY71310. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 

XX 

PS Claim 26; Fig 2A; 122pp; English. 
XX 

CC The present sequence is a cDNA encoding rat Nogo A protein which is a 
CC potent neural cell growth inhibitor and is free of all central nervous 
CC system (CNS) myelin material with which it is natively associated. 
CC The present sequence was generated by fusing R018U37-3, R1-3U21 cDNA 
CC sequences isolated from hexanucleot ides -primed rat brain stem/spinal cord 
CC library, and 01il8 cDNA from an oligo d(T) -primed rat oligodendrocyte 
CC library. Nogo proteins and fragments displaying neurite growth inhibitory 
CC activity are used in the treatment of neoplastic disease of the CNS 



CC e.g. glioma, glioblastoma, medulloblastoma , craniopharyngioma, ependyoma, 

CC pinealoma, haemangioblastoma , acoustic neuroma, oligodendroglioma, 

CC menagioma, neuroblastoma or retinoblastoma and degenerative nerve 

CC diseases e.g. Alzheimer's and Parkinson's diseases. Therapeutics which 

CC promote Nogo activity can be used to treat or prevent hyperprol iterative 

CC or benign dysprol iterative disorders e.g. psoriasis and tissue 

CC hypertrophy. Ribozymes or antisense Nogo nucleic acids can be used to 

CC inhibit production of Nogo protein to induce regeneration of neurons or 

CC to promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. 

CC The animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which 

CC can treat or prevent disorders or diseases of the CNS. 

CC Note: SEQ ID numbers 35-42 are referred in claim 32 and SEQ ID NO: 29 

CC in disclosure of the specification. However the specification does not 

CC include sequences for these SEQ ID numbers. 
XX 

SQ Sequence 4684 BP; 1358 A; 1048 C; 1112 G; 1166 T; 0 Other; 

Query Match 100.0%; Score 3739.4; DB 21; Length 4684; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 3 740; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 ATTGCTCGTCTGGGCGGCGGCGGCGGCTGCAGCCTGGGACAGGGCGGGTGGCACATCTCG 60 

II NNNNINN NINNN ININ IINIIIIINI ININ III 

Db 1 ATTGCTCGTCTGGGCGGCGGCGGCGGCTGCAGCCTGGGACAGGGCGGGTGGCACATCTCG 60 

Qy 61 ATCGCGAAGGCAGCAGAAGCAGTCTCATTGTTCCGGGAGCCGTCGCCTCTGCAGGTTCTT 12 0 

MINIMI M MINIMI II MINII III IM II MUM MM 

Db 61 ATCG CGAAGG CAGGAGAAG CAGTCT CATTGTT C CGGGAG C CGT CG CCT CTG CAGGTTCTT 120 

Qy 121 CGGCTCGGCTCGGCACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACA 180 

II MINI MINI IIIMIIN NUN INNNNNI llllll III 

Db 121 CGGCTCGGCTCGGCACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACA 180 

Qy 181 ACCGCCCGCGACTCTGAGGAGAAGCGGCCCTGCGGCGGCTGTAGCTGCAGCATCGTCGGC 24 0 

' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ; I 1 1 : 1 1 1 1 1 1 : 1 1 I I 

Db 181 ACCGCCCGCGACTCTGAGGAGAAGCGGCCCTGCGGCGGCTGTAGCTGCAGCATCGTCGGC 24 0 

Qy 241 GACCCGCCAGCCATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGC 3 00 

i I M I I I I I I I I I J 1 : I I i i I I : ' I : I I , I I I I I I ! I I ! I I , I I : II! Ml I 

Db 241 GACCCGCQ^GCCATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGC 3 00 

Qy 3 01 CCGCCCCGGCCTCCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAG 3 60 

I NNN llllll. INIINII ININ II NINI llllll ININ 

Db 3 01 CCGCCCCGGCCTCCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAG 3 60 

Qy 3 61 GACGAGGAGGAGGAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTG 420 

IIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMI 

Db 361 GACGAGGAGGAGGAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTG 420 

Qy 421 CTGGAGAGGAAGCCCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCG 480 

IIMIIIMIIIMMIIIMIIIIIIIIIIIIMIIIIMIIIIIIIIIIMIIIMII 

Db 421 CTGGAGAGGAAGCCCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCG 4 80 

Qy 481 CCGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCC 54 0 

IMIMMMIIIIIIlllllll IIMII IMII III Mill IIIMMIIM MINI 



Db 



4 81 CCGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCC 54 0 



Qy 541 GCGCCCCCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCG 600 

ill! 1 1 1 1 1 1 : 1 1 1 1 1 1 1 : 1 1 : 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 i i 1 1 1 1 1 1 ! 

Db 541 GCGCCCCCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCG 600 

Qy 601 CCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCT 660 

IMIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIMIIIIIIIMIIIIMIIIIMI 

Db 601 CCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCT 660 

Qy 661 CCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCG 72 0 

Ml MMM MMMMIMM II IMMI IMIIIMI llllll Mill 

Db 661 CCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCG 720 

Qy 721 CCCCCTTCCACGCCGGCCGCGCCCAAGCGCAGGGGCTCCGGCTCAGTGGATGAGACCCTT 78 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIMIIIIIIIIIII 

Db 721 CCCCCTTCCACGCCGGCCGCGCCCAAGCGCAGGGGCTCCGGCTCAGTGGATGAGACCCTT 78 0 

Qy 781 TTTGCTCTTCCTGCTGCATCTGAGCCTGTGATACCCTCCTCTGCAGAAAAAATTATGGAT 84 0 

MMIM IMMI MUM Ml I llllll II llllll II !M II MM 

Db 781 TTTGCTCTTCCTGCTGCATCTGAGCCTGTGATACCCTCCTCTGCAGAAAAAATTATGGAT 84 0 

Qy 841 TTGATGGAGCAGCCAGGTAACACTGTTTCGTCTGGTCAAGAGGATTTCCCATCTGTCCTG 900 

Ml llllll IIIIIMII IMMI MMM M IMMIMI IMMI Ml 

Db 841 TTGATGGAGCAGCCAGGTAACACTGTTTCGTCTGGTCAAGAGGATTTCCCATCTGTCCTG 900 

Qy 901 CTTGAAACTGCTGCCTCTCTTCCTTCTCTATCTCCTCTCTCAACTGTTTCTTTTAAAGAA 960 

Ml IMMI MMIMMMIMM IMMI IMMIMI II IMMI Ml 

Db 901 CTTGAAACTGCTGCCTCTCTTCCTTCTCTATCTCCTCTCTCAACTGTTTCTTTTAAAGAA 96 0 

Qy 961 CATGGATACCTTGGTAACTTATCAGCAGTGTCATCCTCAGAAGGAACAATTGAAGAAACT 102 0 

Ml MMM MMMMMMIMI IMMMMMIIM IMMIMI MM 

Db 961 CATGGATACCTTGGTAACTTATCAGCAGTGTCATCCTCAGAAGGAACAATTGAAGAAACT 1020 

Qy 1021 TTAAATGAAG CTTCTAAAGAGTTG CCAGAGAGGG CAACAAATC CATTTGTAAATAGAGAT 108 0 

Ml llllll MMMMMMIMI IMMI IMMIMI IMMIMI III 

Db 1021 TTAAATGAAG CTTCTAAAGAGTTG CCAGAGAGGG CAACAAAT C CATTTGTAAATAGAGAT 1080 

Qy 1081 TTAG CAGAATTTTCAGAATTAGAATATTCAGAAATGGGATCAT CTTTTAAAGG CT CCCCA 114 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIMIIIIIIIIIIMI 

Db 1081 TTAGCAGAATTTTCAGAATTAGAATATTCAGAAATGGGATCATCTTTTAAAGGCTCCCCA 114 0 

Qy 1141 AAAGGAGAGTCAGCCATATTAGTAGAAAACACTAAGGAAGAAGTAATTGTGAGGAGTAAA 12 00 

MMIMMMIMM IMMI MMM II IMMIMI III II Ml 

Db 1141 AAAGGAGAGTCAGCCATATTAGTAGAAAACACTAAGGAAGAAGTAATTGTGAGGAGTAAA 12 00 

Qy 12 01 GACAAAGAGGATTTAGTTTGTAGTGCAGCCCTTCACAGTC^ 1260 

IIIIMIII II III llllll IIIIIIMMIIII lllllillllll! MINIMUM 

Db 1201 GACAAAGAGGATTTAGTTTGTAGTGCAGCCCTTCACAGTCCACAAGAATCACCTGTGGGT 1260 

Qy 1261 AAAGAAGACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATG 132 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 11 ! 1 1 1 1 1 i 1 1 1 1 1 1 1 

Db 12 61 AAAGAAGACAGAGTTGTGTCTCCAGAAAAGAC^TGGAO^TTTTTAATGAAATGCAGATG 1320 

Qy 1321 TCAGTAGTAGCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGC^TGG 138 0 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 M I II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1321 TCAGTAGTAGCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGG 1380 



Qy 13 81 GAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTG CTGG CTG CTAGAG CTAATGTGGAA 144 0 

II MMM llllll M MMM MMM MMIIIM llllll IMIM I 

Db 1381 GAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTGCTGG CTG CTAGAG CTAATGTGGAA 144 0 

Qy 1441 AGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGGAAGGAT 1500 

I llllll lllilllll llllll IMIIMIIIIIMII llllll llllli I 

Db 1441 AGTAAAGTGGACAGAAAATG CTTGGAAGATAGC CTGGAG CAAAAAAGTCTTGGGAAGGAT 150 0 

Qy 15 01 AGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGACAGC 156 0 

I I I I I I I II I I II I I I I M I I I I I I II I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I 

Db 1501 AGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGACAGC 1560 

Qy 1561 TCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCGAAAGCACCACAGCAAAC 162 0 

I IIIIIMM MMM MMMI IMIII II IMMIMI llllll llllll 

Db 1561 TCCAGAGC^TATATTACCTGTGCTTCCTTTACCT^^ 162 0 

Qy 1621 ACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAAATAGAA 168 0 

I M IMIII IMIM IMIM IMMMMMIMM IMMIIM Mill 

Db 1621 ACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAAATAGAA 168 0 

Qy 1681 GAAAGGAAGG CC CAAATTATAACAGAGAAGACTAG CCCCAAAACGT CAAATCCTTT CCTT 174 0 

MMI MMM Mill Ml MM II IMIM IIIIIIMI MMM M II! 

Db 1681 GAAAGGAAGGCCCAAATTATAACAGAGAAGACTAGCCCCAAAACGTCAAATCCTTTCCTT 174 0 

Qy 1741 GTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACA^ 18 00 

Mill MMM IMIMIM llllll MMM MMMIMMIMM II li 

Db 1741 GTAG(^GTACAGGATTCTGAGGCAGATTATGTTACAA(^a 18 00 

Qy 1801 GAGG(^G(^GTGTCAAAC^TGCCTGAAGGTCTGACGC(^GATTTAGTTCAGGAAGCATGT 1860 

MM llllll II MMM MMMI IMIM II MMM llllll M III 

Db 18 01 GAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAGGAAGCATGT 18 6 0 

Qy 1861 GAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTT^ 192 0 

MM llllll II MMM :MMI IMIM M llllll IMMIMI IM 

Db 18 61 GAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAAGTGGACTTG 192 0 

Qy 1921 GTCCAAACATCAGAAGCTATACAAGAATCACTTTACCCC^ 198 0 

MM MMM ll IMIII MMM IMIM II IMIM II MMM IM 

Db 1921 GTCCAAACATCAGAAGCTATACAAGAATCACTTTACCCCA 198 0 

Qy 1981 TTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGCACCA 2 04 0 

IMIMIMM II IMIM MMM llllll II llllll II IMIM IM 

Db 1981 TTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGCACCA 204 0 

Qy 2 041 TTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATCCCCACTG 2100 

IIIIMIIMIMI IMIII llllll IIIIIIMI IMMIMI Mill! II 

Db 2 041 TTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATCCCCACTG 2100 

Qy 2101 GAAGCACCTC CTC CAGTTAGTTATGACAGTATAAAG CTTGAG C CTGAAAAC CC C CCACCA 2160 

MM llllll II IMIII MMMI llllll II IMIII IMMIMI IM 

Db 2101 GAAGCACCTCCTC CAGTTAGTTATGACAGTATAAAG CTTGAG C CTGAAAAC CCC CCACCA 2160 

Qy 2161 TATGAAGAAG CCATGAATGTAG CACTAAAAGCTTTGGGAACAAAGGAAGGAATAAAAGAG 222 0 

Mill MMM IMMMIMMMM llllll MMIIIM llllll IMIM 

Db 2161 TATGAAGAAG C CATGAATGTAG CACTAAAAGCTTTGGGAACAAAGGAAGGAATAAAAGAG 222 0 



Qy 2221 CCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATATCCATTGCGTGT 2280 

MM lllllllll llllll IMMIMI MMIIMI MIMMMMM II 

Db 2221 C CTGAAAGTTTTAATG CAG CTGTTCAGGAAACAGAAG CT CCTTATATAT CCATTG CGTGT 228 0 

Qy 2281 GATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTCTCTAATTATTCA 234 0 

IIIMIMIMMIMIIIMIIIIIIIMIIIIIIIIIMMIIIIIMIMIIIIMI 

Db 22 81 GATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTCTCTAATTATTCA 234 0 

Qy 2341 GAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTGGAGGATTCCTCA 24 0 0 

MMMMMMMMIMM II MMII llllll II MMII llllll III 

Db 2341 GAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTGGAGGATTCCTCA 24 0 0 

Qy 24 01 CCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCCACAAACA 24 6 0 

MIMMMMMMMMI MM llllll llllll ll MMII llllll II 

Db 24 01 CCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCCACAAACA 2460 

Qy 2461 CAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTCTGAGACAGTAGCC 252 0 

MMIM MMM II III M IMMMMMM II IMMMMMIMI 

Db 24 61 CAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTCTGAGACAGTAGCC 252 0 

Qy 2 521 CAG CACAAAGAGGAGAGACTTAGTG CCTCACCTCAGGAG CTAGGAAAGCCATATTTAGAG 258 0 

MMIM llllll II III MlMIMI MMIIMI MMMMMMIMM 

Db 2521 CAG CACAAAGAGGAGAGACTTAGTG CCT CACCT CAGGAG CTAGGAAAGC CATATTTAGAG 258 0 

Qy 2581 TCTTTTCAGCCCAATTTACATAGTACAAAAGATGCT 264 0 

IMMMMMM M !M:M :MIM M MMM MMMMMMIMM 

Db 2 581 TCTTTTCAGCCCAATTTACATAGTACAAAAGATGCTC 264 0 

Qy 2 641 ACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTTAATACTGCAAT 2700 

! MMM MIMI II MMII llllll II llllll MMMMMIMM 

Db 2641 ACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTTAATACTGCAATTTATTCAAAT 270 0 

Qy 27 01 GATGACTTACTTTCTTCTAAGGAAGAO\AAATAAAAGAAAGTGAAA(^TTTTCAGATTCA 276 0 

IMMMMMM M MMII MMII II MMM MMMMMMIMM 

Db 27 01 GATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTGAAACATTTTCAGATTCA 2760 

Qy 2 761 TCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGCTAAAGATGATTCTCCT 282 0 

MMIM llllll II III II MMM IMIIIMI llllll MIMMII 

Db 2 761 TCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGCTAAAGATGATTCTCCT 282 0 

Qy 2 821 AAATTAGCCAAGGAGTACACTGATCTAGAAGTATCCGACAAAAGTGAAATTGCTAATATC 28 8 0 

llllllll llllll II III M IMMl llllll IMMIMI illllllll 

Db 2 821 AAATTAGCCAAGGAGTACACTGATCTAGAAGTATCCGACAAAAGTGAAATTGCTAATATC 28 8 0 

Qy 2881 CAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAAT 294 0 

III Ml I MIMMMMM II MMII llllll IMIIIMI MMIIMI 

Db 2881 CAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAAT 2 94 0 

Qy 2 941 ATATATCCTAAAGATGAAGTACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTA 3000 

IMIMIIII IIIMIIIMMEIIIMMIIMI IIIIIMIIIMIIIIII IIIIMI 

Db 2 941 ATATATCCTAAAGATGAAGTACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTA 300 0 

Qy 3 001 TCTAAGGCATCCATATCGCCTTCAAATGTCTCTGCTTTGGAACCTC^GAC^GAAATGGGC 3060 

Mill lllllllll MMM MMIMMMIMM IMIIIMI llllll II 

Db 3 001 TCTAAGGCATCCATATCGCCTTCAAATGTCTCTGCTTTGGAACCTCAGACAGAAATGGGC 3060 



Qy 



3 061 AGCATAGTTAAATCC^ATCACTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTGACACA 312 0 



IIMII II III III Mill llllll IIIIIIIIIMIIIIIIMIIIIMI lllllllll 

Db 3061 AGCATAGTTAAATCCAAATCAGTTACGAAAGAAGCAGAGAA 312 0 

Qy 3121 GAGAAAGAGGACAGATCC CTGTCAG CTGTATTGT CAG CAGAG CTGAGTAAAACTTCAGTT 318 0 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 S 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 

Db 3121 GAGAAAGAGGACAGATCCCTGTCAGCTGTATTGTCAGCAGAGCTGAGTAAAACTTCAGTT 3180 

Qy 3181 GTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTA 324 0 

1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ! 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 

Db 3181 GTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTA 324 0 

Qy 3241 TTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTG 3300 

III! MIIUM llll'l lillll IMIII II MINI M Mill! Ill 

Db 3241 TTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTG 33 00 

Qy 3301 GCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAG 3360 

IIIIIIIIMIIMIIIIIIIIIIMIIIIIIMIIIIMIIIIIIIIIIIMMMIII 

Db 3301 GCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAG 3360 

Qy 3361 AAATCAGATGAAGGCCACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCAGAG 342 0 

III! Mill:::! Mill! llllll MINI II llhll II hlMI II 

Db 3361 AAATCAGATGAAGG CCACC CATTCAGGG CATATTTAGAAT CTGAAGTTG CTATATCAGAG 342 0 

Qy 3421 GAATTGGTTCAGAAATACAGTAATTCTGCTCT^ 34 8 0 

INI IMII li. llllll llllll IIIIM II ililllillMIIIII II 

Db 3421 GAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAAGAA 348 0 

Qy 34 81 CTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATG 354 0 

M M M M I M M M M M M M M I M M M M I M I M M 1 1 1 ! M M I M M M M I 

Db 3481 CTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATG 354 0 

Qy 3 541 TGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTG 3600 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 

Db 3541 TGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTG 3600 

Qy 3601 ATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCATTAT 3660 

I I I I I I II I II II I II I II I I I M I II II M II I II II I I I I M II II I M I I II II II I 

Db 3601 ATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCATTAT 366 0 

Qy 3661 CTAGGACTTGCAAACAAGAGTGTTAAGGATGCCATC 372 0 

Mill llllll II mIIII IIIIIMllMI II llllll i: HUM II 

Db 3661 CTAGGACTTGCAAACAAGAGTGTTAAGGATGCCATGGCCAAAATCCAAGCAAAAATCCCT 372 0 

Qy 3721 GGATTGAAG CG CAAAG CAGAT 3741 

Mlllllllllllllllllll 
Db 3721 GGATTGAAGCGCAAAGCAGAT 3741 



RESULT 2 
ABN86600 

ID ABN86600 standard; DNA; 4684 BP. 
XX 

AC ABN86600; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Rat neurotransmitter receptor protein Nogo encoding DNA . 



XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective ; ant i- tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic ; 

KW osteopathic; vasotropic; nephrotropic ; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; rat; gene; ds . 
XX 

OS Rattus norvegicus . 
XX 

FH Key Location/Qualifiers 

FT CDS 253 . .3744 

FT /*tag= a 

FT /product= "Nogo -A" 

XX 

PN US2002072493-A1 . 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-0893348 . 
XX 

PR 19-MAY-1998 ; 98IL-0124500 . 

PR 21-JUL-1998; 98WO-US14715 . 

PR 22-DEC-1998; 98US-0218277 . 

PR 19-MAY-1999; 99US-0314161 . 
XX 

PA (YEDA ) YEDA RES & DEV CO LTD . 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR P-PSDB; ABB81074, ABB81076, ABB81077. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in 

PT the central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides 
XX 

PS Disclosure; Page 40-44; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central /peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central /peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease 

CC is diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 



CC as Creutzf eldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma - 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates) , Charcot -Marie-Tooth disease, ataxia 

CC telangectasia , Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy , Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents a DNA 

CC encoding the rat neurotransmitter receptor protein Nogo (Nogo-A, Nogo-B 

CC and Nogo-C) , an example of NS-specific antigen. 

XX 

SQ Sequence 4684 BP; 1358 A; 1047 C; 1112 G; 1167 T; 0 other; 

Query Match 100.0%; Score 3739.4; DB 24; Length 4684; 

Best Local Similarity 100.0%; Pred. No . 0 ; 

Matches 3740; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 ATTGCTCGTCTGGGCGGCGGCGGCGGCTGCAGCCTGGGACAGGGCGGGTGGCACATCTCG 60 

IMMMIMMM MUM MUM IMMMM II MUM IMMIMII 

Db 1 ATTGCTCGTCTGGGCGGCGGCGGCGGCTGCAGCCTGGGACAGGGCGGGTGGCACATCTCG 60 

Qy 61 ATCGCGAAGGCAGCAGAAGCAGTCTCATTGTTCCGGGAGCCGTCGCCTCTGCAGGTTCTT 12 0 

Illllllllllll llllll IIIIIMI IMIII II MINI 11:111 III 

Db 61 ATCGCGAAGGCAGGAGAAGCAGTCTCATTGTTCCGGGAGCCGTCGCCTCTGCAGGTTCTT 12 0 

Qy 121 CGGCTCGGCTCGGCACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACA 18 0 

MM IMMMM MUM llllll II IMMMM MMM II III II 

Db 121 CGGCTCGGCTCGGCACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACA 180 

Qy 181 ACCGCCCGCGACTCTGAGGAGAAGCGGCCCTGCGGCGGCTGTAGCTGCAGCATCGTCGGC 24 0 

MM IMMMM MMM MMM II IMMMM MMM II III IM 

Db 181 ACCGCCCGCGACTCTGAGGAGAAGCGGCCCTGCGGCGGCTGTAGCTGCAGCATCGTCGGC 24 0 

Qy 241 GACCCGCCAGCCATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGC 3 00 

MM IIMMIII MIMMMMM M MMM II MMM MMM II! 

Db 241 GACCCGCCAGCCATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGC 3 00 

Qy 301 CCGCCCCGGCCTCCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAG 360 

IIIIMIIIIIIIMIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIII 

Db 301 CCGCCCCGGCCTCCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAG 360 

Qy 361 GACGAGGAGGAGGAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTG 42 0 

MM MMM II MMM llllll MMMMMMMIMM llllll II 

Db 361 GACGAGGAGGAGGAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTG 42 0 

Qy 421 CTGGAGAGGAAGCCCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCG 48 0 

MM MIMMM MMMMIMM IMMIIM llllll M MMMMM 

Db 421 CTGGAGAGGAAGCCCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCG 48 0 

Qy 481 CCGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCC 54 0 

■ : ■ 1 : M 

Db 481 CCGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCC 54 0 



Qy 



541 GCGCCCCCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCG 60 0 



Db 541 GCGCCCCCTGGCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCG 600 

Qy 6 01 CCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCT 660 

III iillll II. IN II Mill! IIIIIIMIilllMIMI llhll ill 

Db 6 01 CCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCT 660 

Qy 661 CCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCG 72 0 

1 1 ! I i 1 1 1 1 1 1 1 1 1 1 - 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 : 1 1 1 : 1 1 : 1 I I I 

Db 661 CCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCG 72 0 

Qy 721 CCCCCTTCCACGCCGGCCGCGCCCAAGCGCAGGGGCTCCGGCTCAGTGGATGAGACCCTT 78 0 

III MMM IMMMI llllll MMMMMMIMI MUM II Ml 

Db 721 CCCCCTTCCACGCCGGCCGCGCCCAAGCGCAGGGGCTCCGGCTCAGTGGATGAGACCCTT 78 0 

Qy 781 TTTGCTCTTCCTGCTGCATCTGAGCCTGTGATACCCTCCTCTGCAGAAAAAATTATGGAT 84 0 

Ml MMM II IM II MMM MMMMMMIMI M III II Ml 

Db 781 TTTGCTCTTCCTGCTGCATCTGAGCCTGTGATACCCTCCTCTGCAGAAAAAATTATGGAT 84 0 

Qy 841 TTGATGGAGCAGCCAGGTAACACTGTTTCGTCTGGTCAAGAGGATTTCCCATCTGTCCTG 900 

III MIMI llllll MMlMII MMMMMMIMI I! Mi II III 

Db 841 TTGATGGAGCAGCCAGGTAACACTGTTTCGTCTGGTCAAGAGGATTTCCCATCTGTCCTG 900 

Qy 901 CTTGAAACTGCTGCCTCTCTTCCTTCTCTATCTCCTCTCTCAACTGTTTCTTTTAAAGAA 960 

limillllllMIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIMIIMI 

Db 901 CTTGAAACTGCTGCCTCTCTTCCTTCTCTATCTCCTCTCTCAACTGTTTCTTTTAAAGAA 960 

Qy 961 CATGGATACCTTGGTAACTTATCAGCAGTGTCATCCTCAGAAGGAACAATTGAAGAAACT 102 0 

MMlMII iMIIMM III II MMMMMMIMI MMM M MM 

Db 961 CATGGATACCTTGGTAACTTATCAGCAGTGTCATCCTCAGAAGGAACAATTGAAGAAACT 102 0 

Qy 1021 TTAAATGAAGCTTCTAAAGAGTTGCCAGAGAGGGCAACAAATCCATTTGTAAATAGAGAT 108 0 

IMMMII MIMi || Ml M |M II MMlMII IMMMIMM 

Db 1021 TTAAATGAAG CTT CTAAAGAGTTG C CAGAGAGGG CAACAAAT C CATTTGTAAATAGAGAT 108 0 

Qy 1081 TTAG CAGAATTTTCAGAATTAGAATATTCAGAAATGGGATCATCTTTTAAAGG CTC CCCA 114 0 

IMIIMIMIIIIIIIIII MINIM MMI I llllll IMIIMMIMI MIMI 

Db 1081 TTAGCAGAATTTTCAGAATTAGAATATT CAGAAATGGGATCATCTTTTAAAGG CT CCCCA 114 0 

Qy 1141 AAAGGAGAGTCAGCCATATTAGTAGAAAACACTAAGGAAGAAGTAATTGTGAGGAGTAAA 12 00 

II IIIIIIIIMIIIIIIIIIIlliillllllllllilll II III Mill IIIIIIIMI 

Db 1141 AAAGGAGAGTCAGCCATATTAGTAGAAAACACTAAGGAAGAAGTAATTGTGAGGAGTAAA 12 00 

Qy 1201 GACAAAGAGGATTTAGTTTGTAGTGCAGCCCTTCACAGTCCACAAGAATCACCTGTGGGT 1260 

1 1 1 1 1 1 1 i I J 1 1 1 1 1 1 ! J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 i 1 1 1 1 1 1 i 1 1 1 1 1 1 1 

Db 12 01 GACAAAGAGGATTTAGTTTGTAGTGCAGCCCTTCACAGTCCACAAGAATCACCTGTGGGT 1260 

Qy 12 61 AAAGAAGACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATG 132 0 

1 1 i f 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 f J M f 1 1 1 ! 1 1 i 1 1 1 1 M I E 1 1 1 1 f 1 1 1 J 1 1 1 1 J I 

Db 1261 AAAGAAGACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATG 132 0 

Qy 1321 TCAGTAGTAGCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGG 13 8 0 

1 1 1 1 ! 1 1 1 I < 1 1 1 1 1 1 1 1 1 1 ; I ! 1 1 1 1 i ! ! 1 . 1 1 1 M I ! 1 1 1 1 i M 1 1 1 II 1 1 . 

Db 1321 TCAGTAGTAGCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGG 13 8 0 

Qy 1381 GAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTG CTGG CTGCTAGAGCTAATGTGGAA 144 0 
I I I I I I I I I I I I I I I I I I I I ! I I I I I I ! ! I I I I I I I : I I I I I I I I II I I I i I I I I 



Db 



13 81 GAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTG CTGGCTGCTAGAG CTAATGTGGAA 144 0 



Qy 1441 AGTAAAGTGGACAGAAAATG CTTGGAAGATAG CCTGGAGCAAAAAAGTCTTGGGAAGGAT 15 00 

llllll IIIMI IMMMMMMM MMMM MMM IIIMI MM 

Db 1441 AGTAAAGTGGACAGAAAATG CTTGGAAGATAG CCTGGAG CAAAAAAGTCTTGGGAAGGAT 15 00 

Qy 1501 AGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGACAGC 1560 

MINI! llllll llllll II llllll MINI M llllli IIMII MM 

Db 1501 AGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGACAGC 1560 

Qy 1561 TCCAGAGCATATATTACCTGTGCTTCCTTTACCTC^GCAACCGAAAGC^CCACAGCAAA^ 1620 

1 1 1 1 1 1 1 1 1 1 1 1 - 1 1 1 1 ! I 1 1 I i 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 1 MM i I I M 1 1 1 1 1 

Db 1561 TCCAGAGCATATATTACCTGTGCTTCCTTTACCTCA^ 1620 

Qy 1621 ACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAAATAGAA 1680 

IIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIII 

Db 1621 ACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAAATAGAA 1680 

Qy 1681 GAAAGGAAGGCCCAAATTATAACAGAGAAGACTAGCCCCAAAACG 174 0 

MMMI MM IIMII IIIIIMMIIIIIMI IM II llllll 

Db 1681 GAAAGGAAGG CCCAAATTATAACAGAGAAGACTAG CCC CAAAACGTCAAATC CTTT CCTT 174 0 

Qy 1741 GTAG CAGTACAGGATTCTGAGG CAGATTATGTTACAACAGATACCTTAT CAAAGGTGACT 18 00 

MM II IMM! llllll MMM II llllll MMMMMIMM! 

Db 1741 GTAG CAGTACAGGATTCTGAGG CAGATTATGTTACAACAGATAC CTTAT CAAAGGTGACT 18 00 

Qy 18 01 GAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAGGAAGCATGT 18 60 

MM IMIIMI MMM MMMM MMM II MM IMIMM 

Db 18 01 GAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATT^ 18 60 

Qy 18 61 GAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAA^ 1920 

MMM II MMM MMM MMMIMMMM MMM llllll 

Db 18 61 GAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAAGTGGACTTG 192 0 

Qy 1921 GTCCAAACATCAGAAGCTATACAAGAATCACTTTACCCCACAGCACAGC^ 1980 

MIMI! IIMIIMI IIMII IIIIIMMIIIIIMI MMMMIMI 

Db 1921 GTCCAAACATCAGAAGCTATACAAGAATCACTTTACCC^ 1980 

Qy 1981 TTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGCACCA 2 04 0 

MIM MMMM IIMII Mill IMIIIIMI MMMMIMI 

Db 1981 TTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGCACCA 2 04 0 

Qy 2 041 TTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATCCCCACTG 2100 

MIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

Db 2 041 TTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATCCCCACTG 2100 

Qy 2101 GAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAACCCCCCACCA 2160 

MM I! MMMMMIMMMMMMMM II llllll IMIMM 

Db 2101 GAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAACCCCCCACCA 2160 

Qy 2161 TATGAAGAAG CCATGAATGTAG CACTAAAAG CTTTGGGAACAAAGGAAGGAATAAAAGAG 2220 

MIMM IIMIMM llllll II MM I MIMMMIMM MMMM 

Db 2161 TATGAAGAAGCCATGAATGTAGCACTAAAAGCTTTGGGAACAAAGGAAGGAATAAAAGAG 222 0 

Qy 2221 CCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATATCCATTGCGTGT 2280 

1 1 1 1 1 - 1 1 i I : I 1 1 1 1 1 1 1 1 : ; 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 

Db 2221 CCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATATCCATTGCGTGT 2280 



Qy " 2281 GATTTAATTAAAGAAACAAAG CT CTCCACTGAGCCAAGTC CAGATTT CTCTAATTATT CA 234 0 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIII 

Db 2281 GATTTAATTAAAGAAACAAAG CTCTCCACTGAGCCAAGTCCAGATTTCTCTAATTATTCA 234 0 

Qy 2341 GAAATAG CAAAATT CGAGAAGTCGGTG CCCGAACACG CTGAG CTAGTGGAGGATTCCTCA 240 0 

I I . I 1 1 I j I ! i I i J I : I ; ! I I ! 1 1 I I 1 1 1 I i 1 1 1 i I I 1 1 II 1 1 I I I I : 1 1 1 1 I I 

Db 2341 GAAATAG CAAAATTCGAGAAGTCGGTG CCCGAACACG CTGAG CTAGTGGAGGATTCCTCA 24 00 

Qy 24 01 CCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCCACAAACA 246 0 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIMIIIIII 

Db 24 01 CCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCCACAAACA 2460 

Qy 24 61 CAAGAGGAGG CTGTGATGCT CATGAAGGAGAGTCTCACTGAAGTGTCTGAGACAGTAG CC 252 0 

I MMM MIMMMMIMM MMMMMIIMM II IMMMIMM 

Db 2461 CAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGAAGTGTCTGAGACAGTAGCC 252 0 

Qy 2521 CAGCACAAAGAGGAGAGACTTAGTGCCT(^CCTO^GGAGCTAGGAAAGCCATATTTAGAG 258 0 

I I 1 1 1 1 1 1 i I i ! 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 .Mill 1 1 1 I ^ I > 1 1 1 1 1 1 1 1 1 1 

Db 2521 CAGCACAAAGAGGAGAGACTTAGTGCCTCACCTCAGGAGCTAGGAAAGCCATATTTAGAG 258 0 

Qy 2581 TCTTTTCAGCCCAATTTACATAGTACAAAAGATGCTGCATCTAA 264 0 

1 1 I ! 1 1 i 1 1 1 1 1 1 1 1 1 ! : 1 1 1 ! 1 1 i I MM 1 1 1 1 1 1 1 i I II IIMM 1 1 : 1 1 1 

Db 2 581 TCTTTTCAGCCCAATTTACATAGTACAAAAGATGCT 264 0 

Qy 2641 AC CAAAAAGGAGAAAATTTCTTTG CAAATGGAAGAGTTTAATACTG CAATTTATT CAAAT 270 0 

IMIMIMM MIMI MIMI II MIMI Ml MIMI IIIMIMIMM 

Db 2641 ACCAAAAAGGAGAAAATTT CTTTG CAAATGGAAGAGTTTAATACTG CAATTTATTCAAAT 27 0 0 

Qy 2701 GATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTGAAACATTTTCAGATTCA 2760 

Mill MMMMIMM MMM II MMM II MMMMMMMIMM 

Db 2701 GATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTGAAACATTTTCAGATTCA 2760 

Qy 2761 T CTCCGATTGAGATAATAGATGAATTTCC CACGTTTGTCAGTG CTAAAGATGATTCTC CT 282 0 

MM MMMMIMM MIMI II MMM MMIMMMIMMIMM 

Db 2761 T CT CCGATTGAGATAATAGATGAATTTCC CACGTTTGTCAGTG CTAAAGATGATTCTC CT 282 0 

Qy 2821 AAATTAG CCAAGGAGTACACTGATCTAGAAGTATCCGACAAAAGTGAAATTGCTAATAT C 288 0 

MM MIMI IIMM MIMI II MIMI MMMMMMIMMIMM 

Db 2821 AAATTAG CCAAGGAGTACACTGATCTAGAAGTATCCGACAAAAGTGAAATTGCTAATAT C 2880 

Qy 2881 CAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAAT 2 94 0 

MIM MMM IIMM IIMM M IIMM II MMMMMMMIMM 

Db 28 81 CAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAAT 2 94 0 

Qy 2 941 ATATAT CCTAAAGATGAAGTACATGTTTCAGATGAATTCT CCGAAAATAGGTCCAGTGTA 3000 

■Mil IIMM IIMM IIMM II IIMM II MMMMMMMIMM 

Db 2 941 ATATATCCTAAAGATGAAGTACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTA 3000 

Qy 3 001 TCTAAGGCATCCATATCGCCTTCAAATGTCTCTGCTTTGGAACCTCAGACAGAAATGGGC 3 060 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 IIMM I M M 1 1 1 II I i 1 1 1 1 1 1 1 1 1 

Db 3 001 TCTAAGGCATCCATATCGCCTTCAAATGTCTCTGCTTTGGAACCTCAGACAGAAATGGGC 3 060 

Qy 3 061 AGC^TAGTTAAATCCAAATC^CTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTGACACA 312 0 

IIIMIMIIIIIMIIIIMIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 3 061 AGCATAGTTAAATCCAAATCACTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTGAC^CA 312 0 



Qy 3121 GAGAAAGAGGACAGATCCCTGTCAGCTGTATTGTCAGCAGAGCTGAGTAAAACTTCAGTT 318 0 

MINI IIIIM IIIIIMII llilll lllllhllllllllMIMII I; I 

Db 3121 GAGAAAGAGGACAGATCCCTGTCAGCTGTATTGTCAGCAGAGCTGAGTAAAACTTCAGTT 318 0 

Qy 3181 GTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTA 324 0 

IIIIIIIMIIIIIIIIIMIIIIIIIIIIMIIIIIIMMIIIIIMIIIIIIIIIII 

Db 3181 GTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTA 324 0 

Qy 3241 TTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTG 3300 

MM MIMIMI MIMMMIMMIMM II Mill MMIIMI I 

Db 3241 TTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTG 3300 

Qy 3301 GCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAG 3360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 II MM I ( I M M I II I ! 1 1 1 1 M 1 1 ; i ! 

Db 33 01 GCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAG 3360 

Qy 3361 AAATCAGATGAAGGCCACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCAGAG 342 0 

MMMMMIMM IMIM MIMI M Mill II MIMI MMIMI 

Db 33 61 AAATCAGATGAAGGCCACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCAGAG 342 0 

Qy 3421 GAATTGGTTC^GAAATAC^GTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAAGAA 3480 

I MM MMIMI III MM IMMIMMI MIMI IIIMM 

Db 3421 GAATTGGTTCAGAAATACAGTAATTCTG CT CTTGGT CATGTGAACAG CACAATAAAAGAA 34 8 0 

Qy 34 81 CTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATG 354 0 

1 1 I i I 1 1 i I I ! ! 1 1 Mill I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 i 1 1 1 1 1 1 

Db 34 81 CTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATG 354 0 

Qy 3 541 TGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTG 3600 

1 1 1 1 1 1 1 1 1 1 ^ 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 1 1 II MM 1 1 1 1 M M M 1 1 M I M 1 1 1 

Db 3 541 TGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTG 3600 

Qy 3 601 ATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCATTAT 3660 

MIMM IMIM II MIMI MIMI MMIMIMMIMM MUM 

Db 3 601 ATCTCAGTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCATTAT 3660 

Qy 3 661 CTAGGACTTGCAAACAAGAGTGTTAAGGATGCCATGGCCAAAATCCAAGCAAAAATCCCT 3 72 0 

1 1 1 1 1 1 II llilll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : M M M M I 1 1 1 1 1 1 1 II MM 1 1 

Db 3 661 CTAGGACTTGCAAAC7VAGAGTGTTAAGGATGCCATGGCCAAAATCCAAGCAAAAATCCCT 372 0 

Qy 3 721 GGATTGAAG CG CAAAG CAGAT 3741 

I MM IMMIII 

Db 3721 GGATTGAAG CG CAAAG CAGAT 3741 
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CC acids, vectors expressing them and antibodies against them, to isolate 

CC agents which block nogo receptor mediated axonal growth. The agent is 

CC useful for treating a central nervous system disorder which is a result 

CC of cranial or cerebral trauma, spinal cord injury, stroke or a 

CC demyelinating disease selected from multiple sclerosis, monophasis 

CC demyel ination, encephalomyelitis, multifocal leukoencephalopathy, 

CC panencephalitis, Marchiafava-Bignami disease, pontine myelinolysis, 

CC adrenol eukodys trophy, Pelizaeus-Merzbacher disease, Spongy degeneration, 

CC Alexander's disease, Canavan's disease, metachromatic leukodystrophy, 

CC viral infection and Krabbe's disease. 

XX 
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21; 



Qy 134 C^CGACTCGGCCTGCCTGGCCCCTGCt^GTCTTGCCC^CCCCCACAACCGCCCGCGACT 193 

I II III I I III llllll Mill I : ! 1 1 1 1 1 : 1 1 I i 1 1 1 1 1 1 II 

Db 16 CTCGGCT(^GTCGGCC(^GCCCCTCTC7VGTCCTCCCCAACCCCCACAACCGCCCGCGGCT 75 

Qy 194 CTGAGGAGAAGCGGC - CCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCGACCCGCCAGCC 252 

Mill I II I I IMIMM I 1 1 II 1 1 1 1 II II II I MM MMM 

Db 76 CTGAGACGCGGCCCCGGCGGCGGCGGCAGCAGCTGCAGCATCATC-TCCACCCTCCAGCC 134 

Qy 253 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 312 

Illllll I IMIMM I lllll IMMM MINI IN IMIMM 

Db 135 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 191 

Qy 313 CCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAG 372 

I MMM llllllllllillllllll 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 : 1 II II III 

Db 192 CAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG GAAGAAGAG 248 

Qy 373 GAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAG 432 

IIMII lllllill II lllll Mill lllll MINI MINIM II 

Db 24 9 GAGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAGCTGGAGGTGCTGGAGAGGAAG 3 08 

Qy 433 CCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCGCCGCGCCGCTG 486 

Mill I MMM Ml II I MM I II MMM llllll Ml 

Db 309 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 368 

Qy 4 87 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 546 

lllllill I I MM lllll II IMIMM II II lllllllllll III 

Db 36 9 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 428 

Qy 547 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 597 

II I III II III Illllll II lllll I Mill I I I MM 

Db 42 9 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 48 8 

Qy 5 98 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

llllll II I I 1 1 II I M I II 1 1 MMM MMM MMM I 

Db 48 9 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 54 8 

Qy 658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

IMIMM MM II Mill II II Ml III IMIMM III III 

Db 54 9 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 608 

Qy 712 CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 75 0 

I 1 1 ; 1 1 1 1 1 1 1 1 lllll MMM MMM II 

Db 609 TGGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 668 

Qy 751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 8 07 

MMM! IMMMIM llllll IIIMIIMMIIIIMM llllll I! 

Db 66 9 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 72 8 

Qy 8 08 GTGATACC CTC CTCTG CAGAAAAAATTATGGATTTGATGGAG CAG CCAGGTAACACTGTT 867 

Illllll Mill Mill! llllll MM llllllllllillllllll M 

Db 72 9 GTGATACG CTC CTCTG CAGAAAA : TATGGACTTGAAGGAGCAGCCAGGTAACACTATT 785 

Qy 868 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 927 

III M MMM llllll llllll MMM II MMM MIMM II 

Db 786 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 845 



Qy 



928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 



II IIIIIIIMIM I I III 1 1 1 1 1 1 1 1 MINIMI II Ml II 

Db 846 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 905 

Qy 988 GTGTC^TCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 1047 

II I I II I Mill II Mill I I I 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 I II 

Db 9 06 GTATTACCC^CTGAAGGAAC^CTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTO^ 965 

Qy 1048 GAGAGGG CAACAAATC CATTTGTAAATAGAGATTTAG CAGAATTTTCAGAATTAGAATAT 1107 

1 1 1 1 Mill II II I I II IMIIMMM MM I i 1 1 1 1 1 1 1 1 1 1 1 1 

Db 966 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 1025 

Qy 1108 TCA.GAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCATATTAGTAGAA 1167 

IMM II MMM II I I III Mill III II Ml II MUM I 

Db 1026 T CAGAAATGGGATCAT CGTTCAGTGTCTCTC CAAAAG CAGAATCTGCCGTAATAGTAG CA 1085 
Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGT 1224 

M III Mill MM MM Mill II Mill Mill Ml I 

Db 1086 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 114 5 
Qy 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 1269 

INI I II MINI I 1 1 1 1 III Mill II 

Db 1146 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1205 

Qy 1270 AGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTAGTA 1329 

MINIUM 1 1 1 1 1 III MM I i 1 1 ! 1 1 1 i 1 1 I I MM I I 

Db 12 06 GAAGTTGTGTCTT CAGAAAAAG CAAAAGACAGTTTTAATGAAAAGAGAGTTG CAGTGGAA 1265 

Qy 133 0 GCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGGGAAGTGAAA 138 9 

M III I II II MIMMMII II IMM I II Ml MM 

Db 1266 G CT CCTATGAGGGAGGAATATG CAGACTT CAAACCATTTGAG CGAGTATGGGAAGTGAAA 1325 
Qy 13 90 GATACTTATGAGGGAAGTAGGGATGTG CTGGCTG CTAGAGCT AATGTG 143 7 

MM I Ml I Ml III II MM III I II II 

Db 1326 GATA GTAAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAACTTG 1382 

Qy 143 8 GAAAGTAAAGTGGACAGAAAATG CTTGGAAGATAG CCTGGAG CAAAAAAGT CTTGGGAAG 14 97 

IMIMI II I MMM M I MM II II 1 1 1 1 1 I II I II 

Db 1383 GAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAAAAA 1442 

Qy 14 98 GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 1557 

I I M I I I I I II I II II III MMMIMMIMM MMM I I I I I I I 
Db 1443 GATAGTGAGAGTAGTAATGATGATACTTCTTT CC C CAGTACG CCAGAAGGTATAAAGGAT 1502 

Qy 1558 AGCTCCAGAGCATATATTACCTGTGCTTCCTTTA CCTCAGCAACCGAAAGCACCACA 1614 

I II IMIMIMI M MMM MMM I IMIMI M MM II 

Db 1503 CGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCA 1562 

Qy 1615 GCAAACACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAA 1674 

MMM III MIIIIMM Mill I 1 . 1 1 : 1 1 I I || I II II M II II I 

Db 1563 ACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAAAAA 1622 
Qy 1675 ATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 1731 

IMIMIMI IMM Ml MM MIMMMII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1623 ATAGAAGAAAAGAAGGCCCAAATAGTAACAGAGAAGAATACTAGCACCAAAACATCAAAC 1682 

Qy 1732 CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 1791 

Mill MIIIIMM 1 1 ! 1 1 ! 1 1 1 ! 1 1 1 MIIIIMM IMIMIMI III II 



Db 


1683 


CCTTTTCTTGTAGOVGC^CAGGATTCTGAGACAGATTATGTC^CAACAGATAATTTAACA 


1742 


Qy 


1792 


AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 

IMMIMMIII II III IIIIIIIIIIIIIIII Mill IIIIIIIIIII III 

AAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTACAG 


1851 


Db 


1743 


1802 


Qy 


1852 


GAAGC^TGTGAAAGTGAACTGAATGAAGCCACAGGTAOWVGATTGCTTATGAAACAAAA 

IIIMIIIMIMIIIII lllllllll II lllllllllllllllllllllllllll 

GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAA 


1911 


Db 


1803 


1862 


Qy 


1912 


GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCACTTTACCCCACAGCACAGCTT 
i i i i i i i i i i i i i i i i i i i i i i i iii i i t i i i i i i i ii ii iiiiiiiiiti 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 Mill II II 1 1 1 II 1 1 1 1 1 1 

ATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAGCTT 


1971 


Db 


1863 


1922 


Qy 


1972 


TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 
iiiiiiiiiiitii ii i i i i i t t t i i i i i i i i i i i i i i i i i i i i iiiiiiiii 

1 1 M 1 1 1 1 1 1 II II II 1 1 II 1! Mill 1 1 1 II 1 1 II M M 1 1 1 1 lllllllll 

TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 


2031 


Db 


1923 


1982 


Qy 


2032 


GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 


2091 


Db 


1983 


IIIIIIIIIII Mill MM II MIIMIMM II 1 MIMIII 1 

GAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCA 


2042 


Qy 


2092 


TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 


2151 


Db 


2043 


II III 1 MM II II llllll 1 M Mill 1 ! : 1 1 

TCACCATTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAAC 


2099 


Qy 


2152 


C CCC CAC CATATGAAGAAGC CATGAATGTAG CACT AAAAGCTTTGGGAACAAAGGAA 

1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 MMMI MM MM Mill 1 MM MMMI 

CCCCC^CC^TATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATC^GGAATAAAGGAA 


2208 


Db 


2100 


2159 


Qy 


2209 


GGAATAAAAGAGC CTGAAAGTTTTAATG CAG CTGTTCAGGAAACAGAAGCT C CTTATATA 

1 III I'M 1 M 1 1 II II 1 II MM 1. 1 Ml 1 ! 

GAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTATATA 


2268 


Db 


2160 


2219 


Qy 


2269 


TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 

II Mill MIMMMMMMIMMMIMI II MM III III llllll 

TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 


2328 


Db 


2220 


2279 


Qy 


2329 


T CTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTG CC CGAACACG CTGAG CTAGTG 

Ml Mill Ml II III MUM I II II 1 Mill M II II MM MM 

TCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTT 


2388 


Db 


2280 


2339 


Qy 


2389 


GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 

II Illlllllllllll 1 1 Ml i M III 1 1 || Mill 

GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 


2448 


Db 


2340 


2399 


Qy 


2449 


GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTGACTGA A 

II Mill llllll II MIMIIMI MM II IIIIIIIIIII 1 

GTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCA 


2502 


Db 


2400 


2459 


Qy 


2503 


GTGT CTGAGACAGTAGCCCAG CACAAAGAGGAGAGACTTAGTGC CTCAC CTCAGGAG CTA 

1 1 III 1 1 M MM 1 Ml Mill 1 II 1 III 1 

TTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGCCACCTGAGGGA 


2562 


Db 


2460 


2519 


Qy 


2563 


GGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCA 

Ml 1 II M llllll III II MM III lllllllll 1 1 

GGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTGTTA 


2619 


Db 


2520 


2579 



Qy 2620 TCTAATGACATTCCAACAT^ 2679 

II 1 1 1 1 'II MINIMI Ml II llllll I IIIIIM MMI III I 

Db 2 580 CCTGATGAAGTTTCAACATTGAG CAAAAAGGAGAAAATT C CTTTGCAGATGGAGGAGCT C 263 9 

Qy 268 0 AATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAA 273 9 

I IIIMII I II HUM III I II llllllll I MM MM 

Db 2 64 0 AGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAA 269 9 

Qy 274 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 27 9 9 

I MUM IIIIIIMMII Mill II llllllll II II II M II 

Db 2700 ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 2759 
Qy 2800 AGTGCTAAAGATGATTC TCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCC 28 56 

Mi Mill llllll I IMMMIMM III M Mill llllllllllll 

Db 2760 AGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 2819 

Qy 2857 GACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTG 2916 

1 : 1 1 . 1 1 1 1 1 1 . 1 1 1 1 1 1 1 II II II I MMMMMM IIIMII 

Db 282 0 CACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTG 28 7 9 
Qy 2 917 CCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 2970 

III llllllllllll Mill III I II MM Mill I I I III 

Db 2880 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCA 293 9 

Qy 2 971 GATGAATTCTCCGAAAATAGGT C CAGTGTATCTAAGG CAT CCATAT CGCCTTCAAATGT C 3030 

Mill II II Mill MM I III MM I III MM II MM 

Db 2 94 0 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 2 9 9 9 

Qy 3031 TCTGCTTTGGAACCTCAGACAGAAATGGGCAGC^TAGTTAAATCCAAATC^CTTACGAAA 3090 

MM MM MM MM II I llllllllllll Mill III MM 

Db 3 000 TCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAA 305 9 

Qy 3 091 GAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTA 3150 

Mill IIIIIIIIIIIIIIIII II Mill IMIMMMMM I II III M 

Db 3 060 GAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCTGCTATA 3119 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

II 1 1 1 1 1 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 M I ; : 1 1 1 1 MM MMMMMM 

Db 3120 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAG 317 9 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 32 7 0 

lllllllllllllllllllllllllll lllllllllllll II IIIMII llllll 

Db 3180 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGC 323 9 

Qy 32 71 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 333 0 

Mill II Mill MMMMMMMIMMMIMM Mill MMMMMM 

Db 324 0 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 32 9 9 

Qy 3331 ATATATAAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 33 90 

Mill Mill I i i 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I ! 1 1 M 1 1 1 1 1 1 

Db 3300 ATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 3359 

Qy 33 91 TATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCT 3450 

Ml I Illlllllllllllllllll Mill lllllllllll IIIIIIMIIIII II 

Db 33 60 TAT CTGGAAT CTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATT CTG CT 3419 



Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3 510 

IIIIIIIIMIIIII 1 1 1 1 Mill Mill Mill II I ! II I : I I I 

Db 342 0 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 3479 

Qy 3511 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 3 570 

IIIMIII MMMMMMMMMMIIMM Mill MMIMMIMIMM 

Db 348 0 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 3 53 9 

Qy 3571 AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 3 630 

IN MMIIII! Ilhll Mill M MM II III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 354 0 AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 3 599 

Qy 3631 GAA CGG CAT CAGGTG CAGATAGAT CATTAT CTAGGACTTG CAAA CAAGAGTGTTAAGGAT 3690 

Illllllllllil Mill II Ml II MMM MM 1 1 1 1 1 1 1 1 1 1 III 

Db 360 0 GAACGG CAT CAGG CACAGATAGAT CATTATCTAGGACTTG CAAATAAGAATGTTAAAGAT 3659 

Qy 3691 GCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGa^GA 3 74 0 

II Mill MMMMMMIMMIMMMIIMMMMIMI II 

Db 3660 G CTATGG CTAAAAT C CAAG CAAAAATC C CTGGATTGAAG CG CAAAG CTGA 3 709 
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DR P-PSDB; ABP68600. 
XX 

PT New isolated polynucleotide and pancreatic tumor polypeptides, useful 

PT for diagnosing, preventing and/or treating cancer, particularly 

PT pancreatic cancer 
XX 

PS Claim 1; SEQ ID NO 53; 300pp + Sequence Listing; English. 
XX 

CC The invention relates to an isolated polynucleotide (I) comprising: (a) 

CC any of a group of over 4000 nucleotide sequences (ABV94628-ABV99145) ; 

CC (b) complements of (a) ; (c) sequences consisting of at least 20 

CC contiguous residues of (a) ; (d) sequences that hybridize to (a) , under 

CC moderately stringent conditions; (e) sequences having at least 75% or 90% 

CC identity to (a) ; or (f ) degenerate variants of (a) . Polypeptides 

CC (ABP68596-ABP68637) encoded by (I) and oligonucleotide can be used to 

CC detect cancer in a patient and compositions comprising polypeptides, 

CC polynucleotides, antibodies, fusion proteins, T cell populations and 

CC antigen presenting cells expressing the polypeptide are useful in 

CC treating pancreatic cancer and stimulating an immune response. The 

CC polynucleotides can be used as probes or primers for nucleic acid 

CC hybridisation, in the design and preparation of ribozyme molecules for 

CC inhibiting expression of the tumour polypeptides and proteins in the 

CC tumour cells, in vaccines and for gene therapy. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo. int/pub/published_pct_sequences . 
XX 

SQ Sequence 4632 BP; 1398 A; 1013 C; 1011 G; 1210 T; 0 other; 

Query Match 62.6%; Score 2343.6; DB 24; Length 4632; 

Best Local Similarity 81.3%; Pred. No. 0; 

Matches 3017; Conservative 0; Mismatches 574; Indels 119; Gaps 21; 
Qy 134 CACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACAACCGCCCGCGACT 193 

I II III I I III 1 1 1 1 1 1 Mill I llllilll MINIMI I II 

Db 23 CTCGGCTCAGTCGGCCCAGCCCCTCTCAGTCCTCCCCAACCCCC^CAACCGCCCGCGGCT 82 

Qy 194 CTGAGGAGAAGCGGC - CCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCGACCCGCCAGCC 252 

Mill I II I I Mill I I [ 1 1 1 1 1 1 1 1 1 E II I MM MUM 

Db 83 CTGAGACGCGGCCCCGGCGGCGGCGGCAGCAGCTGCAGCATCATC - TCCACCCTCCAGCC 141 

Qy 253 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 312 

1 1 1 1 1 1 1 1 1 I MM M I Mill I Ml Illllll llllllll 

Db 142 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 198 

Qy 313 CCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAG 372 

I 1 1 1 1 1 1 IMMMMIMMIMM M , M II Ml 

Db 199 CAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG GAAGAAGAG 255 

Qy 373 GAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAG 432 

llllllll llllllll II Mill Mill Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

Db 256 GAGGAGGAAGAGGAGGACGAGGACGAAGAC CTGGAGGAGCTGGAGGTG CTGGAGAGGAAG 315 
Qy 433 CCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCGCCGCGCCGCTG 486 

Mill MMMMMIMI I! I MM I II Illllll llllll Ml 

Db 316 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 3 75 



Qy 487 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 54 6 

MINI I I 1 1 1 1 III II MINI II II III II III Ml 

Db 376 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 435 
Qy 54 7 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 5 97 

II I III II III MM II Mill I Mill I I I MM 

Db 436 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 4 95 

Qy 598 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

MMMMM II I I MMMMMM MMMMMM MMMMMM 

Db 4 96 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 555 
Qy 658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

MIMIII MM II Mill II II III III II Ml III III 

Db 556 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 615 
Qy 712 CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

IMMMMIMM Mill 

Db 616 TGGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 675 

Qy 751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 8 07 

Mill I I : I I I I I I I I ! I I I I I I I I I I I I I I I I I I ! I I I I I I I ; I 

Db 676 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 735 

Qy 8 08 GTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAGGTAACACTGTT 867 

MINI IIIIIIMIIIMM MINI 1 1 1 1 MINI MMI MINIM I 

Db 73 6 GTGATA CG CT C CT CTG CAGAAAA TATGGACTTGAAGGAG CAG CCAGGTAACACTATT 7 92 

Qy 8 68 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 927 

III II II MUM MUM IIMIMIIIMI IUIM MIMIII 

Db 793 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 8 52 

Qy 92 8 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 

M MMMMMM I I MMM MMMMM MMMMMM M Ml II 

Db 853 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 912 

Qy 988 GTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 104 7 

II I I II I IIIMIMI II MMM I I I MIMMIMMMM I II 

Db 913 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 972 

Qy 1048 GAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAGAATTAGAATAT 1107 

MM Mill II II I I II Ml. MM Ml I i I I 

Db 973 GAGAAGG CAAAAACT CTACT CATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 1032 

Qy 1108 TCAGAAATGGGATCATCTTTTAAAGGCT C C CCAAAAGGAGAGTCAG CCATATTAGTAGAA 1167 

IIIIIIIIIIIIIIII II I I Ml IIIMII III II III II MMM I 

Db 1033 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 1092 
Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGT 1224 

II Ml IIIMII MM MM I MM II Mill I MMM Ml I 

Db 1093 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1152 
Qy 122 5 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 1269 

MMM i II MMM I MM III MMM II 

Db 1153 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1212 



Qy 



127 0 AGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTAGTA 132 9 



1 1 1 1 I I 

CAGTGGAA 1272 



Db 


1213 


GAAGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTGGAA 


1272 


Qy 


1330 


GCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGGGAAGTGAAA 

II III 1 II IIMIIIIMI II IMMMI 1 II Mill, 

G CTC CTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAG CGAGTATGGGAAGTGAAA 


1389 


Db 


1273 


1332 


Qy 


1390 


GATACTTATGAGGGAAGTAGGGATGTG CTGGCTG CTAGAG CT - - AATGTG 

MM 1 Ml 1 Ml III II MINI. Ill 1 II II 

GATA GTAAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAACTTG 


1437 


Db 


1333 


1389 


Qy 


1438 


GAAAGTAAAGTGGACAGAAAATG CTTGGAAGATAG CCTGGAG CAAAAAAGTCTTGGGAAG 

1 1 1 1 1 1 1 1 1 1 M ! ! 1 llllll II 1 lllllllll lllllll 1 M 1 II 

GAAAGTAAAGTGGATAAAAAATGTTTTG CAGATAG CCTTGAG CAAACTAATCACGAAAAA 


1497 


Db 


1390 


14 4 9 


Qy 


1498 


GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 

IMMMI 1 II Mill Ml MIMMIMMMM MUM 1 1 Mill 

GATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAGGAT 


1557 


Db 


1450 


1509 


Qy 


1558 


AGCTCCAGAGCATATATTACCTGTGCTTCCTTTA CCTCAGCAACCGAAAGCACCACA 


1614 


Db 


1510 


1 II llllllllll II llllll llllll 1 MIMM II MM II 

CGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCA 


1569 


Qy 


1615 


GCAAACACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAA 


1674 


Db 


1570 


llllll III llllllllll Mill lllllllllllllll II IMIIIIIIIII 

ACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAAAAA 


1629 


Qy 


1675 


ATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 

llllllllll llllllllllll lllllllllll llllll lllllll Mill 

ATAGAAGAAAAGAAGG C C CAAATAGTAA CAGAGAAGAATA C TAG CA C CAAAA CAT CAAA C 


1731 


Db 


1630 


1689 


Qy 


1732 


CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 

Mill IIMIIMM 'II llllllllll llllllllll III II 

CCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTAACA 


1791 


Db 


1690 


1749 


Qy 


1792 


AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 

II 1 II 1 II 1 II 1 1 II III llllllllllllllll Mill IMMIIMM Ml 

AAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTACAG 


1851 


Db 


1750 


1809 


Qy 


1852 


GAAG CATGTGAAAGTGAACTGAATGAAG CCACAGGTACAAAGATTG CTTATGAAA CAAAA 

MMMMMIMIMM IIMIIMI II MMMMMMMMMMMIMM 

GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAA 


1911 


Db 


1810 


1869 


Qy 


1912 


GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCA 

MMIIIMI MIMMMMM III Mill Mill M II IMIIIMIII 

ATGGACTTGGTTCAAACAT(^GAAGTTATGCAAGAGT(^CTCTATCCTGCAGCACAGCTT 


1971 


Db 


1870 


1929 


Qy 


1972 


TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 

i Ml. II 1 Mill Mill hl.lJil lllllllll 

TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 


2031 


Db 


1930 


1989 


Qy 


2032 


GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 


2091 


Db 


1990 


IMMIIMM MUM MM II II 1 1 M 1 II 1 1 II 1 MIIIMI 1 

GAAGC^CCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCA 


2049 


Qy 


2092 


TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 


2151 



II II llllll llllll II Mill I 1 1 1 1 1 1 1 1 1 1 M 



Db 



2 050 TCACCATTAGAAG- - 



- CTTCTTCAGTTAATTATGAAAG CATAAAACATGAGC CTGAAAAC 2106 



Qy 


2152 


CCCCCACCATATGAAGAAGCCATGAATGTAGCACT- - -AAAAGCTTTGGGAACAAAGGAA 

1 1 1 1 1 l l l l l l l l l l l l l l l 1 l l l llll till l l l l : i i i i i t i > i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 | i 1 1 1 

CCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAGGAA 


2208 


Db 


2107 


2166 


Qy 


2209 


GGAATAAAAGAGCCTGAAAGTTTTAATG CAG CTGTTCAGGAAACAGAAG CTCCTTATATA 
i iii i i i i i i < i i i i i i i i i i i i i i i i i i iiii i i i i i i i i i i i i i i i i i i i i i 

1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GAAATTAAAGAGCCTGAAAATATTAATG CAG CTCTTCAAGAAACAGAAG CTCCTTATATA 


2268 


Db 


2167 


2226 


Qy 


2269 


TCCTVTTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCO^GTCCAGATTTC 
it i i i i i i i i i i i i i i i i t i i i i i t i i i i i i t i ii iiii iii lit 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 


2328 


Db 


2227 


2286 


Qy 


2329 


TCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTG 
iii i i i i i i i i i i i i i t i i i i i i ii ii i t i i i i ii ii i i i i i t i i i i 


2388 


Db 


2287 


1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCTGATTATTCAGAAATGGCAAAAGTTGAACAGC CAGTG C CTGATCATT CTGAG CTAGTT 


2346 


Qy 


2389 


GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 
ii i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i t ■ i i i i i i ii i i i i i 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 ! 1 1 1 1 1 1 II 1 1 1 1 1 

GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 


2448 


Db 


2347 


2406 


Qy 


2449 


GTCCCACAAACAC^AGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGA A 
ii i i i i i i i i i i i i i ii i i i t i i i i i t iiii ii i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

GTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCA 


2502 


Db 


2407 


2466 


Qy 


2503 


GTGTCTGAGACAGTAGCCCAGCACAAAGAGGAGAGACTTAGTGCCTCACCTCAGGAGCTA 
i i iii i iii i i i i i i i i i i i i i i iii iii i 


2562 


Db 


2467 


1 1 III 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 III Ml I 

TTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTG C CAC CTGAGGGA 


2526 


Qy 


2563 


GGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCA 

i i i i i i i i i i i i i i ii i i i i i i iii ii iiii iii i i i i i i i i i i i 


2619 


Db 


2527 


1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGAAAG C CATATTTGGAATCTTTTAAG CTCAGTTTAGATAACACAAAAGATACC CTGTTA 


2586 


Qy 


2620 


TCTAATGACATTCCAACATTGACCAAAAAGGAGA 

II llll II lllliiit i i i i i t i i i i i i i i i i i iiiiiii itiii iii i 

II MM II lllllllll IMIMIIIIIIIIII IIIIIII Mill III 1 

CCTGATGAAGTTTCAA(^TTGAG(^W\AAGGAGAAAATTCCTTTGCAGATGGAGGAGCTC 


2679 


Db 


2587 


2646 


Qy 


2680 


AATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAA 

1 IIIIIII 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 \ 1 1 1 1 II 1 1 1 1 1 1 1 1 1 i i i iiii iiii 


2739 


Db 


2647 


M l l M M l M l l M l l l l l l l l M l M l l l M l l M M M l l l M l M l 

AGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAA 


2706 


Qy 


2740 


AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 

1 llllll 1 1 1 1 t 1 1 1 1 1 1 1 1 1 t I 1 lllll il iiiiiiii ii ii ii ii ii 

1 llllll IMIIIIIIIIMMII lllll II IIIIIIII II II II II II 

ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 


2799 


Db 


2707 


2766 


Qy 


2800 


AGTG CTAAAGATGATTC TCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCC 

III lllll llllll 1 llllllllllll III II Mill IIIIIIMIIII 

AGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 


2856 


Db 


2767 


2826 


Qy 


2857 


GACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTG 

MIIIMI llllli II III II II || I llllllllllll Mlllll 

CACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTG 


2916 


Db 


2827 


2886 


Qy 


2917 


CCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 

III llllllllllll lllll III I II llll lllll I l 1 III 

CCCCATGACCTTTCTTTGAAGAACATACAACCOW^GTTGAAGAGAAAATCAGTTTCTCA 


2970 


Db 


2887 


2946 



Qy 2 971 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTC 3 030 

Mill II II Mill MM I Ml MM I III MM II MM 

Db 2 947 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 3 006 

Qy 3 031 TCTGCTTTGGAACCTCAGACAGAAATGGGCAG 3 090 

MMIIIMI MM MM II I 1 f I ! 1 1 f 1 1 1 1 1 Mill Ml MM 

Db 3 007 TCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAA 3 066 

Qy 3 091 GAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTA 3150 

Mill 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 II Mill 1 1 1 ; 1 1 1 1 1 1 1 1 1 I II III II 

Db 3067 GAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCTGCTATA 3126 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

II MMMMMMMMMMMMMMMMIIMM 1 1 M ill! 1 1 1 1 1 1 1 1 

Db 3127 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAG 3186 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 3270 

I ;l Mill; MINI II MM MMMMIMM II Ml Ml I 1 1 III I 

Db 3187 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGC 3246 

Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 333 0 

Mill II Mill III M IIMII llllllllll Mill 1 1 j I ! 1 1 1 [ 1 1 

Db 3247 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 33 06 

Qy 3331 ATATATAAGGG CGTGATCCAGG CTAT CCAGAAATCAGATGAAGG CCACC CATT CAGGGCA 3390 

Mill Mill II IIMII IIMII III III Mill III II Ihll! Mill 

Db 33 07 ATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 33 66 

Qy 33 91 TATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCT 34 50 

III I IIIMIIIIIIIIIIIIIII Mill MMIMIMI Mill 

Db 3367 TATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCT 3426 

Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3510 

I IM llllli I MM Mill MM! Mill II II II II 1 1 II 1 1 1 1 II 1 1 

Db 3427 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 3486 

Qy 3511 GTTGATTCCCTG7VAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 3570 

I M 1 1 1 1 1 M III Mill MINI MINIMI Mill 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

Db 34 87 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 354 6 

Qy 3571 AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 363 0 

II MMIMIMI IIMII II Mill II 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 i 1 1 1 i ' 

Db 3 547 AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 3606 

Qy 3 631 GAACGG CATCAGGTG CAGATAGAT CATTATCTAGGACTTG CAAACAAGAGTGTTAAGGAT 3690 

MMMMIMM II MMIMIMI MIMIMIMM II MM IIMII III 

Db 3 607 GAACGG CATCAGG CACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTTAAAGAT 3666 

Qy 3691 G CCATGG CCAAAATC CAAG CAAAAATC C CTGGATTGAAG CG CAAAG CAGA 374 0 

II Mill MMMMMMMMMMMMMMMMIIMM II 

Db 3 667 GCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGA 3716 
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New secreted or transmembrane proteins and polynucleotides encoding 


PT 


them, useful for treating neurodegenerative disorders, autoimmune 


PT 


diseases and cancer - 


XX 






PS 


Claim 72; Page 321-322; 357pp ; English. 


XX 






CC 


The invention relates to 40 human secreted proteins (AAY94 981 -Y95020 ) , 


CC 


and cDNA sequences encoding them (AAA23423 -A23462 ) . The secreted 


CC 


proteins of 


the invention include those that are thought to be only 


CC 


partially secreted, i.e., transmembrane proteins. The proteins of the 


CC 


invention may exhibit one or more activities selected from the following 


CC 


cytokine activity; cell proliferation; differentiation; immune 


CC 


modulation; 


haematopoiesis regulation; tissue growth activity; 



CC activin/inhibin activity; chemotact ic/chemokinetic activity; haemostatic 

CC and thrombolytic activity; ant i- inflammatory activity; and tumour 

CC inhibition activity. The proteins may be administered to patients as 

CC vaccines, and the nucleotides may be used as part of a gene therapy 

CC regime. Diseases or conditions that may be treated using the proteins or 

CC nucleotides of the invention include autoimmune diseases; genetic 

CC disorders; haemophilia; cardiovascular diseases; cancer; bacterial, 

CC fungal and viral infections, especially HIV; multiple sclerosis; 

CC rheumatoid arthritis; pulmonary inflammation; Guillain-Barre syndrome; 

CC insulin dependent diabetes mellitus; and allergic reactions such as 

CC asthma and anaemia. They may also be used for treating wounds, burns, 

CC ulcers, osteoporosis, osteoarthritis, periodontal diseases, Alzheimer's 

CC disease, Parkinson's disease, Huntington's disease and amyotrophic 

CC lateral sclerosis (ALS) . Proteins with activin/inhibin activity may 

CC additionally be useful as contraceptives. Nucleic acid sequences of the 

CC invention may be used in chromosome mapping, and as a source of 

CC diagnostic primers and probes. The present sequence represents cDNA 

CC encoding one of the 4 0 proteins of the invention. 

XX 

SQ Sequence 4093 BP; 1213 A; 926 C; 928 G; 1026 T; 0 other; 

Query Match 62.4%; Score 2333.2; DB 21; Length 4093; 

Best Local Similarity 81.3%; Pred. No. 0; 

Matches 3017; Conservative 0; Mismatches 573; Indels 120; Gaps 22; 
Qy 134 (^CGACTCGGCCTGCCTGGCCCCTGCC^GTCTTGCCC^CCCCCAC^ACCGCCCGCGACT 193 

I II III I I III IMIII Mill I mill MINIMI INN II 

Db 33 CTCGGCTCAGTCGGCCCAGCCCCTCTCAGTCCTCCCCAACCCCCACAACCGCCCGCGGCT 92 

Qy 194 CTGAGGAGAAGCGGC-CCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCGACCCGCCAGCC 252 

Mill I II I I Mill I II 1 1 II I II 1 1 1 II I MM MUM 

Db 93 CTGAGACGCGGCCCCGGCGGCGGCGG(^GC^GCTG(^G(^T(^TC-TCa^CCCTCCAGCC 151 

Qy 253 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 312 

I III M I I II I M II I Mill MM Ml MUM MINI 

Db 152 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 208 

Qy 313 CCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAG 372 

I MUM MUM IMIII II III IIIMIMI Ml II II III 

Db 2 09 CAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG GAAGAAGAG 265 

Qy 3 73 GAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAG 432 

Mill 1 1 II II 1 1 II Mill Mill Mill IMIIIMI IIIIIMI 

Db 2 66 GAGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAGCTGGAGGTGCTGGAGAGGAAG 325 
Qy 433 CCCG CAG CCGGG CTGT CCG CAG CTG CGGTG C CGCCCGCCGCCGCCGCGCCGCTG 486 

Mill 1 1 1 1 1 1 E I i 1 1 1 1 1 1 1 I MM I II Mill IMIII III 

Db 326 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 385 

Qy 487 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 54 6 

IIIIIMI I I MM Mill II IMIII M II IMIIMMM III 

Db 386 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC *44 5 
Qy 54 7 CCTG C CG CTC CTGAGAGG CAG CCATC CTGGGAACG CAG CCC CG CGGCGCCC 597 

M I III M III Mill II Mill I Mill I I I MM 

Db 44 6 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 505 



Qy 598 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

Milium ii i i iiiiiiiiiiii iiiiiiiMiiiiii iiiiMiiiiii 

Db 506 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 565 

Qy 658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

Illlllll 1 1 1 1 II Mill II II III III 1 1 1 1 1 1 1 f III Ml 

Db 566 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 625 

Qy 712 CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

1 1 1 1 1 1 1 1 1 1 1 1 i I Mill 1 1 1 1 1 1 1 1 1 1 E 1 1 i E 1 1 1 

Db 626 TGGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 685 

Qy 751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 807 

1 1 1 1 M I ! I IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIII 

Db 686 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 745 

Qy 8 08 GTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAGGTAACACTGTT 867 

IMIIII Ml Ml 1 1 1 1 llllllllllllillllll II 

Db 746 GTGATACG CTCCT CTG CAGAAAA TATGGACTTGAAGGAGCAG CCAGGTAACACTATT 802 

Qy 8 68 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 927 

III MM IIIIIIIIMII! IMMMMIM llilll III III II llllllllllll 

Db 8 03 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 862 

Qy 928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 

II llllllllllll I I llilll Mill llllllllllll II III II 

Db 8 63 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA. 922 

Qy 988 GTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 1047 

II I I Ml Illllllll II MUM I II I Ml III IIIIIMI I II 

Db 923 GTATTACCC^CTGAAGGAA^CTTO^GAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 982 

Qy 1048 GAGAGGGO^CAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAGAATTAGAATAT 1107 

MM Mill II II I I M MINIM MM II I M 1 1 1 1 M 1 1 1 1 1 1 

Db 983 GAGAAGG CAA - AACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 1041 

Qy 1108 TCAGAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCATATTAGTAGAA 1167 

,,1111 M I I III IMIIII III II III II llilll I 

Db 1042 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 1101 

Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGT 1224 

II III IMIIII MM MM I MM II Mill I llilll III I 

Db 1102 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1161 

Qy 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 12 69 

MIMI I II III I MM III llilll II 

Db 1162 AAt^TCCTTCATAATOVACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1221 

'Qy 1270 AGAGTTGTGTCTC(^GAAAAGA(^TGGACATTTTTAATGAAATGCAGATGTCAGTAGTA 132 9 

IMMMMI IMIIII III MM II 1 1 1 i 1 1 1 J I I I MM I I 

Db 1222 GAAGTTGTGTCTTCAGAAAAAGCTUW^GACAGTTTTAATGAAAAGAGAGTTGCAGTGGAA 12 81 

Qy 1330 GCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGC^TGGGAAGTGAAA 138 9 

II Ml IMIIII II MM I || I E 1 1 1 1 1 1 I II M : i I 

Db 1282 GCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTGAAA 1341 



Qy 



13 90 GATACTTATGAGGGAAGTAGGGATGTG CTGG CTG CTAGAG CT 



AATGTG 1437 



1 1 1 1 I III I Ml III II MINIM III I II II 

Db 1342 GATA GTAAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAACTTG 1398 

Qy 1438 GAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAG C CTGGAG CAAAAAAGT CTTGGGAAG 14 97 

MINI INI I MUM II I 1 1 1 E 1 1 1 1 1 MM I II i II 

Db 13 99 GAAAGTAAAGTGGATAAAAAATGTTTTG CAGATAG CCTTGAG CAAACTAAT CACGAAAAA 1458 

Qy 14 98 GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCC C CAGTACCC CAGAACCTGTGAAGGAC 1557 

Mill I I II MM! Ill III! il llllll llllll I I Mill 

Db 14 5 9 GATAGTGAGAGTAGTAATGATGATACTTCTTTCC C CAGTACG CCAGAAGGTATAAAGGAT 1518 

Qy 1558 AGCTCCAGAGCATATATTACCTGTGCTTCCTTTA CCTCAGCAACCGAAAGCACCACA 1614 

I II MINIUM II llllll MUM I Mill II MM II 

Db 1519 CGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCA 1578 

Qy 1615 GCAAACACTTTCCCTTTGTTAGAAGATCATACTTaVGAAAATAAAACAGATGAAAAAAAA 1674 

llllll III MIMIIMI Mill II MUM II M MINIMUM 

Db 1579 ACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAAAAA 163 8 

Qy 1675 ATAGAAGAAAGGAAGG CC CAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 1731 

IIMIIMM NIMMIMM MIINIMM llllll IMMM 1 1 1 1 1 

Db 163 9 ATAGAAGAAAAGAAGG CC CAAATAGTAACAGAGAAGAATACTAG CACCAAAACATCAAAC 1698 

Qy 1732 CCTTTCCTTGTAG(^GTACAGGATTCTGAGGQ\GATTATGTTACAACAGATACCTTATCA 17 91 

III 1 1 MM | | Mill Mill 1 1 II I III II IIMIIMM III M 

Db 1699 CCTTTTCTTGTAGCAGC^C^GGATTCTGAGACAGATTATGTCACAACAGATAATTTAACA 1758 

Qy 17 92 AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 1851 

Mill Mill II III MMM II III INN 1 1 1 1 [ 1 1 M M III 

Db 175 9 AAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTACAG 1818 

Qy 1852 GAAGCATGTGAAAGTGAACTGAATGAAGCCA.CAGGTA 1911 

MIM MMM III III Ml II MMM IMMMM Ml II M 

Db 1819 GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAA 1878 

Qy 1912 GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAAT^ 1971 

NIIINNI IIIIIIINNII III I II 1 1 INN N II IIIIIIIINI 

Db 187 9 ATGGACTTGGTT(^\AACATCAGAAGTTATC 193 8 

Qy 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2 031 

1 1 1 1 1 1 1 1 1 1 1 1 1 II I Mill Mill M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 IMMMM 

Db 1939 TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 1998 

Qy 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2 091 

IMIMIMM llllll MM II ! 1 1 1 1 1 1 1 1 1 M I MINIM I 

Db 1999 GAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCA 2 058 

Qy 2 092 TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 

II III I INI II II llllll llllll II INN I IIIIIIINNII 

Db 2 059 T CACCATTAGAAG CTTCTTCAGTTAATTATGAAAG CATAAAACATGAG CCTGAAAAC 2115 

Qy 2152 CCCCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAGCTTTGGGAACAAAGGAA 22 08 

I I I II I I II I I II N I I II I II I I I I II I II I INN I INI I III III 

Db 2116 CCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAGGAA 2175 

Qy 22 09 GGAATAAAAGAGC CTGAAAGTTTTAATG CAG CTGTTCAGGAAACAGAAGCTC CTTATATA 2268 

I Ml 1 1 M 1 : 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 MM Mill llllllll IIIIIMI 



Db 2176 GAAATTAAAGAGCCTGAAAATATTAATG CAG CT CTTCAAGAAACAGAAGCTCCTTATATA 2235 

Qy 2269 TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 2328 

II Mill i: i ii i i i n 1 1 1 1 in in 1 1 1 1 1 1 

Db 2236 TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 22 95 

Qy 232 9 T CTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTG C CCGAACACG CTGAGCTAGTG 2388 

Ml MM 1 1 1 1 MINI Ml III Mill II II lllllll 

Db 22 96 TCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTT 2355 

Qy 238 9 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 244 8 

II 1 1 1 1 1 i 1 ! 1 1 1 1 1 1 1 1 . ; 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I ! I || 1 1 1 1| 

Db 2356 GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 2415 
Qy 244 9 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGA A 2502 

II lllllll MUM II MIMIIIM MM II IMMIIMM I 

Db 2416 GTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCA 2475 

Qy 2 503 GTGT CTGAGACAGTAG CCCAG CACAAAGAGGAGAGACTTAGTG CCTCACCTCAGGAG CTA 2562 

I I III I I II MM I Ml MM I II I III I 

Db 2476 TTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTG CTTTG CCACCTGAGGGA 2535 
Qy 2563 GGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCA 2619 

llllli lllllll II Mil Ml II MM Ml IIIIIIMI I I 

Db 2536 GGAAAG C CATATTTGGAAT CTTTTAAGCT CAGTTTAGATAACACAAAAGATACCCTGTTA 25 95 

Qy 262 0 TCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTT 2 67 9 

II MM II IIIMIMI IIIIIIIIIIIIIIII lllllll Mill III I 

Db 2596 CCTGATGAAGTTT CAACATTGAG CAAAAAGGAGAAAATTCCTTTG CAGATGGAGGAG CTC 2655 

Qy 268 0 AATACTGC7VATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAA 273 9 

I IMIMI lilllll II IMMIIMM I MM MM 

Db 2 656 AGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAA 2715 

Qy 274 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 2799 

I llllli I 1 . 1 1 . Mill II I. II M II II II 

Db 2 716 ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 277 5 
Qy 2 800 AGTG CTAAAGATGATT C TCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCC 2856 

Ml MMI MUM I MIMMIMM III II MM MIMMIMI 

Db 2776 AGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 2 835 

Qy 2857 GACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTG 2916 

II I M 1 1 1 1 II I II I II II I II II II I MIMMIMM lllllll 

Db 2 83 6 CACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTG 2 8 95 
Qy 2 917 CCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 2 97 0 

III llllllllllll Mill III I II MM Mill I I I Ml 

Db 2896 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTGTCA 2955 

Qy 2 971 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTC 3 03 0 

Mill II II Mill MM I III MM I III II 1 1 II MM 

Db 2 956 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 3 015 

Qy 3 031 TCTGCTTTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCAAATCACTTACGAAA 3090 

MIMIIIM MM MM II I MIMMIMM Mill III MM 

Db 3 016 TCTGCTTTGGCCA.CTCAAGCA.GAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAA 3075 



Qy 3 091 GAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTA 315 0 

I I I I I I I I I I I I I I I I I I I I I I II HIM I I I I I I I I I I I I I I I II IN II 

Db 3 076 GAAGCTGAGAAAAAACTT C CTTCCGATACAGAAAAAGAGGACAGAT CAC CATCTGCTATA 313 5 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

II I MIMI MIMI M MUM IMIM III IIMIII1I Mill 

Db 3136 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAG 3195 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 327 0 

IIIIMMIMIIIIIIMIIIIIIII IIIIIIMIIIII II lllllll i 

Db 3196 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGC 3255 

Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 333 0 

Mill II Mill IM MMIMM MIMI MM Mill MM Mill 

Db 3256 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 3315 

Qy 3331 ATATATAAGGGCGTGAT CCAGG CTATC CAGAAATCAGATGAAGG CCACCCATTCAGGGCA 339 0 

I II 1 1 HIM MM MIMM MIMI IMMMil MIMI Mill 

Db 3316 ATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 3375 

Qy 33 91 TATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCT 345 0 

III I Ml MllillMII Mill Mill II I lllllll IIIIIIMIIIII 

Db 3376 TATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCT 343 5 

Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3510 

lllllllllllllll Mil Mill Mill Mill II I II II i II Mill III 

Db 343 6 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 34 95 

Qy 3511 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 357 0 

I I M i I I I I Mill I I II I M 

Db 34 96 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 3555 

Qy 3571 AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 3 63 0 

I i I I II I M Mill II Ml II I I M 

Db 3 55 6 AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 3615 

Qy 3631 GAACGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGAT 369 0 

1 1 - 1 1 1 1 1 ! 1 1 II: MIMI IIMM IMMIMI MM MIMI III 

Db 3616 GAACGG CATCAGG CACAGATAGAT CATTAT CTAGGACTTG CAAATAAGAATGTTAAAGAT 3675 

Qy 3691 GCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGA 374 0 

II HIM IMIIMI MIMI MIMI II MIMI II 

Db 3676 GCTATGG CTAAAATC CAAG CAAAAAT CCCTGGATTGAAGCG CAAAG CTGA 3725 
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XX 
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CC with bone remodelling or osteoporosis. ABS70344 -ABS70512 represent 

CC human bone remodelling genes of the invention. 

XX 

SQ Sequence 4822 BP; 1441 A; 1046 C; 1073 G; 1247 T; 15 Other; 

Query Match 62.1%; Score 2323.8; DB 24; Length 4822; 

Best Local Similarity 80.9%; Pred. No. 0; 
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Qy 63 CGCGAAGGCAGCAGAAGCAGTCTCATTGTTCCGGGAGCCGTCGCCTCTGCAGGTTCTTCG 122 

I II 1 1 1 1 1 1 II ; 1 1 1 1 : I : Mill i 1 1 1 1 1 1 1 1 1 M I | 1 1 M i mi 

Db 78 CNCGGAGGCAGGAGGAGCAGTCTCATTGTTCCGGGAGCCGTCACCACAGTAGGTCCCTCG 137 

Qy 123 GCTCGGCTCGGC^CGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACAAC 182 

1 1 1 1 I I Ml Mill Mill I IIIIIIIIIIIMM 

Db 13 8 GCTCAGT CGGCCC^GCCCCTCTCAGTCCTCCCC^ACCCCCACAAC 182 

Qy 183 CGCCCGCGACTCTGAGGAGAAGCGGC - CCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCG 241 

1 1 1 1 1 1 1 1 MMIM I II I MM I I IMIIIIMIII M I 

Db 183 CGCCCGCGGCTCTGAGACGCGGCCCCGGNGGCGGCGGCAGCAGCTGCAGCATCATC - TCC 241 

Qy 242 ACCCGCCAGCCATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCC 301 

MM IIIIIIIIIIIIMI I Illlllll I Mill MMIM 

Db 242 ACCCTCCAGCCATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCC 2 98 

Qy 3 02 CGCCCCGGCCTCCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGG 361 

I MMMM I MIMI MM M MMMMM II M 1 1 1 II I II 1 1 II 1 1 1 

Db 299 CACCCCGGCCGCAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG- 357 



Qy 3 62 ACGAGGAGGAGGAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGC 421 

II I I lllll II II llllllll II lllll lllll lllll llllllllll 

Db 358 - - GAAGAAGAGGANGATGAAGAGGAGGAGGAGGACGAAGACCTGGAGGAGCTGGAGGTGC 415 

Qy 422 TGGAGAGGAAGCCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCG 475 

Illlllllilllllll llllllllllllll II I IIII I II IIIIIM 

Db 416 TGGAGAGGAAGCCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCG 475 

Qy 476 CCGCGCCGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGC 535 

III I llllllll I I 1 1 1 1 MM! II llllllll II II IIII 

Db 476 GCGCGCCNNTAATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGC 535 

Qy 536 CGGCCGCGCCCCCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG 589 

IIIIIII Mill I Ml II III MMMI II lllll I lllll I 

Db 536 CGGCCGCTCCGGCGGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGT 5 95 

Qy 5 9 0 - - - CGGCGCCCGCGCCATCCCTGCCGCCCGCTGCCGCAGTGGTGCCCTCCAAGCTCCCAG 64 6 

I I MINIM II I I MINIMI MMM MMM I 

Db 596 CGACCGTGCCCGCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTG 655 

Qy 647 AGGACGACGAGCCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGG 7 06 

IIII II II III I III II II IIII M lllll II II III III IIMMM II 

Db 656 AGGACGACGAGCCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGG 715 

Qy 707 CGGAGC CCGCCGCGCCCCCTTCCACGCCGGCCG 73 9 

I MM IMMMMIMI lllll IIIIIII 

Db 716 CAGAGCCCGTGTGGANCCCGCCAGCCCCGGCTNCCGCCGCGCCCCCCTCCACCCCGGCCG 775 

Qy 74 0 CGCCCAAGCGCAGGGGCTCC - - -GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTG 7 96 

IIIIIIIMMIIIIMIM IIIIIII MM II III III Mill II IMMMMM 

Db 776 CGCCCAAGCGCAGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTG 835 

Qy 7 97 CATCTGAGCCTGTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAG 8 56 

IMMMIIMMIMM MMMMMIMM MMM IIII IIIIIIIIIII 

Db 836 CATCTGAGCCTGTGATACGCTCCTCTGCAGAAAA TATGGACTTGAAGGAGCAGCCAG 8 92 

Qy 857 GTAACACTGTTTCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCT 916 

IIMMM lllll I MMMNMMM MMM II MMM Mill I 

Db 8 93 GTAACACTATTTCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTT 952 

Qy 917 CTCTTCCTTCTCTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTA 976 

IMIMMMIM IMMMIMM I I MMM llllllllll IIIIIIIIIII 

Db 953 CTCTTCCTTCTCTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTA 1012 

Qy 977 ACTTATCAGCAGTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTA 1036 

i ii iii iiii Mill iiiiiiiii ii Mini i i i iiiiiiiiiii 

Db 1013 ATTTGTCAAC^GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTA 1072 

Qy 1037 AAGAGTTGCCAGAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAG 1096 

lllll I MMM lllll II II I I II IMMMMM IIII MMMI 

Db 1073 AAGAGGTCTCAGAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAG 1132 

Qy 1097 AATTAGAATATTCAGAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCA 1156 

IIMIIIIII MMMIMIMIMM II I I III MMMI III II III 

Db 1133 AATTAGAATACTCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCG 1192 



Qy 



1157 TATTAGTAGAAAACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATT 1213 



II Mill Ml Ml III I 1 1 II 1 1 1 1 I 1 1 1 1 II Mill I I 

Db 1193 TAATAGTAGCAAATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGT 12 52 
Qy 1214 TAGTTTGTAGTGCAGCCCTTCACAGTCCACAAGAATCACCT GTGG 1258 

Mill III I llllll I II llllll I MM III 

Db 1253 TAGTTAGTAATAACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGG 1312 

Qy 1259 GTAAAGAAGACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGA 1318 

llllll II IMMIIMI I III Ml MM MIMIIMM I 

Db 1313 TTAAAGAGGATGAAGTTGTGT CTTCAGAAAAAG CAAAAGACAGTTTTAATGAAAAGAGAG 1372 

Qy 1319 TGT CAGTAGTAG CAC CTGTGAGGGAAGAGTATG CAGACTTTAAG CCATTTGAACAAG CAT 1378 

I MM I III III IMIMI II MIMIIMM II MUM I M M 

Db 1373 TTG CAGTGGAAG CT C CTATGAGGGAGGAATATGCAGACTT CAAACCATTTGAG CGAGTAT 1432 
Qy 1379 GGGAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTGCTGGCTGCTAGAGCT 1431 

I I I ' 1 1 1 1 1 1 1 1 I III I Ml III II Ml M Ml I 

Db 1433. GGGAAGTGAAAGATA GTAAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCG 14 8 9 

Qy 1432 AATGTGGAAAGTAAAGTGGACAGAAAATG CTTGGAAGATAG CCTGGAG CAAAAAA 14 86 

II MIIMMMMIMI I llllll M I MMIIMI IMIMI I 

Db 14 90 AGAGCAACTTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTA 154 9 

Qy 14 87 GTCTTGGGAAGGATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAAC 154 6 

M I II 1 1 1 1 1 1 1 1 I II Mill III MUM MUM I llllll 

Db 155 0 AT CACGAAAAAGATAGTGAGAGTAGTAATGATGATACTT CTTT CCCCAGTACG CCAGAAG 1609 
Qy 1547 CTGTGAAGGACAGCTCCAGAGCATATATTACCTGTGCTTCCTTTA CCTCAGCAACCG 1603 

I I Mill I II IMMIIMI II llllll llllll I IMIMI I 

Db 1610 GTATAAAGGATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTG 1669 

Qy " 1604 AAAGCACCACAGCAAACACTTTCCCTTTGTTAGAAGATCATACTT 1663 

I MM II llllll Ml IIIMIMM Mill II MIMI II II I 

Db 1670 AGAGCATTGCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCG 172 9 
Qy 1664 ATG - AAAAAAAAATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAG ACTAGCCCC 1719 

III IIIIIIIIIIIMIIIM II MIMI 1 1 : 1 1 1 : 1 1 1 1 llllll II 

Db 1730 ATGAAAAAAAAAATAGAAGAAAAGAAGGCCCAAATAGTAACAGAGAAGAATACTAG 178 9 

Qy 172 0 AAAACGTCAAATCC-TTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAAC 1778 

Mill Mill II III Mill M Mi II i 1 1 1 1 1 MIMI M Mill 

Db 17 90 AAAACATCAAACCCTTTTACTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAAC 184 9 

Qy 1779 AGATACCTTATCAAAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCC 1838 

Mill Ml III MIIIMM II Ml IMIMI MMM: Mill II 

Db 1850 AGATAATTTAACAAAGGTGACTGAGGAAGT CGTGGCAAACATG C CTGAAGG CCTGACTCC 1909 

Qy 1839 AGATTTAGTT CAGGAAG CATGTGAAAGTGAACTGAATGAAG CCACAGGTACAAAGATTGC 18 98 

MMIIMI IMMMIIMMIMIMM MIIIMM II 1 1 1 1 1 1 1 1 1 1 1 f I ! 

Db 1910 AGATTTAGTACAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGC 1969 

Qy 18 99 TTATGAAACAAAAGTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCACTTTACCC 1958 

IIIMIMM IMMIIMI MMM MM III Mill Mill II II 

Db 1970 TTATGAAAC^AAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCC 202 9 

Qy 1959 CAC^VGCACAGCTTTGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCC 2018 

IMM MMM llllll I II I Mill Mill III MMM M 



Db 2030 TGCAGCACAGCTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCC 208 9 

Qy 2 019 TGATATTGTTATGGAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGT 2078 

III : : I I II IIIIII Mil II I M M I 

Db 2 090 TGACATTGTTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGAT 214 9 

Qy 2079 GCAGCCCAGTGTATCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCT 213 8 

II: III III I MM II II II llllll || Mill | 

Db 2150 ACAGCCCAGCTCATCACCATTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAACA 2206 

Qy 2139 TGAGCCTGAAAACCCCCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAGCTTT 2195 

IMIIIMIIIII IIMI IIIMi Mill Mill MM MM Mill I 

Db 2207 TGAGCCTGAAAACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATC 2266 

Qy 2196 GGGAACAAAGGAAGGAATAAAAGAGCCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGA 2255 

MM IIIMIII III Ml I Illllllllll MM llllllll 

Db 22 67 AGGAATAAAGGAAGAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGA 232 6 

Qy 22 56 AGCTCCTTATATATCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCC 2315 

I I I I I Mill I I. ;l II I 'I I || MM II 

Db 2327 AG CT CCTTATATATCTATTGCATGTGATTTAATTAAAGAAACAAAG CTTT CTG CTGAAC C 23 86 

Qy 2316 AAGTCCAGATTTCTCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACA 23 75 

I III MINIMI MMMMIMM IIIIII III III IIMI II II 

Db 238 7 AGCTCCGGATTTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCA 244 6 

Qy 237 6 CGCTGAGCTAGTGGAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGA 24 3 5 

MMIIIIII II II lllllllllllllllllllllllllllll 

Db 2447 TTCTGAGCTAGTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGA 2506 

Qy 2436 TTCGATTCCTGAAGTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCT 2495 

Ml II Mill II IIIIII IIIIII II MMIIIIII MM II Mill 

Db 2507 TTCAATACCTGACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCT 2 566 

Qy 24 96 CACTGA AGTGT CTGAGACAGTAG C CCAG CACAAAGAGGAGAGACTTAGTG CCTC 254 9 

IIIIII- I I llllllll I I I I I Ml Mill I 

Db 2 567 CACTGAGACTTCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTT 2626 

Qy 2550 ACCTCAGGAGCTAGGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAA 2 60 9 

II I Ml IIIMIMMIMM II MMM III II MM III Mill 

Db 2 627 GCCACCTGAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAA 268 6 
Qy 2 610 AGATGC TGCATCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCA 2666 

MM I I II MM II : II IIMI II 1 1 1 1 1 

Db 2687 AGATACCCTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCA 274 6 

Qy 2667 AATGGAAGAGTTTAATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGA 272 6 

Mill III I I II I III I I II III Mill I llllllll II Mill II I Ml 

Db 2747 GATGGAGGAG CTCAGTACTG CAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGC 28 06 

Qy 2727 CAAAATAAAAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATT 278 6 

I MM Mill IIIMI 1 1 1 ! I ! i 1 1 1 1 [ 1 1 1 1 1 Mill || 1 1 1 1 f 1 1 1 II 

Db 2807 ACAGATAAGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTT 2866 
Qy 2787 TCCCACGTTTGTCAGTGCTAAAGATGATTC TCCTAAATTAGCCAAGGAGTACACTGA 2843 

II II II Mill Mill IIIIII I llllllllllll Ml II Mill 

Db 2 867 CCCTACATTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGA 2926 



Qy 2844 TCTAGAAGTATCCGAC^VAAAGTGAAATTGCTAATATCC^AAGCGGGGCAGATTC^TTGCC 2903 

II 1 Ml I I Ml II || || I llllllll 

Db 2927 CCTAGAAGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCC 2 98 6 
Qy 2904 TTG CTTAGAATTG C C CTGTGACCTTTCTTTCAAGAATATATAT C CTAAAGATGAAG 295 9 

1 1 II I ! 1 1 1 1 1 1 ; I llllllllllll MMI III I 1 1 MM Mill 

Db 2 987 TTGCACAGAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAA 304 6 

Qy 2 9 6 0 - - TACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATC 3017 

I I I llllllll II II Mill 1 1 1 1 I Ml MM I III 

Db 3047 AAT CAGTTT CT CAGATGA CTTTT CTAAAAATGGGT CTG CTACAT CAAAGGTG CT CTTATT 3106 

Qy 3 018 GCCTTCAAATGTCTCTGCTTTGGAACCTCAGAC^ 307 7 

MM M MM IMIIIIIII MM MM II I MMIMIMM MM 

Db 3107 GCCTCCAGATGTTTCTGCTTTGGCC^CTCAAGCAGAGATAGAGAGCATAGTTAAACCCAA 3166 

Qy 3 078 ATt^CTTACGAAAGAAGCTVGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATC 313 7 

I III 1 1 1 1 1 1 1 1 1 MIIIIIMIIIIIMI II Mill 1 1 1 1 1 1 1 ! I ! 1 1 1 

Db 3167 AGTTCTTGTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATC 322 6 

Qy 3138 CCTGTCAGCTGTATTGTCAGCAGAGCTGAG - TAAAACTTCAGTTGTTGACCTCCTCTACT 3196 

I II Ml MM IMMMMIMM 1 1 1 1 M M M 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 MM 

Db 3227 ACCATCTGCTATATTTTCAGCAGAGCTGAGCTAAAACTTCAGTTGTTGACCTCCTGTACT 328 6 

Qy 3197 GGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTC 3256 

I llllll :IM!I MMII II MM I Mill Ml II III II 

Db 3287 GGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCAT 334 6 

Qy 3257 TGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGA 3316 

1 1 1 1 1 1 1 MIIMIMM M Mill IMIIIIIIIIIIIIMMIIIIIII MM 

Db 334 7 TGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGA 34 0 6 

Qy 3317 CTATCAG CTTTAGGATATATAAGGGCGTGATCCAGG CTATC CAGAAAT CAGATGAAGG CC 3376 

I IMMMMIMM MMI llllllll 1 1 1 M 1 1 II I II 1 1 M M 1 1 M 1 1 1 

Db 34 07 CCATCAG CTTTAGGATATACAAGGGTGTGATCCAAG CTAT C CAGAAAT CAGATGAAGG CC 34 66 

Qy 3377 AC CCATT CAGGG CATATTTAGAAT CTGAAGTTG CTATATCAGAGGAATTGGTT CAGAAAT 343 6 

1 1 1 1 1 1 1 1 i 1 1 1 II I i ! I IIIIIIIMIIII Mill Mill IMMIIMM I 

Db 34 67 ACCCATTCAGGGCATATCTGGAAT CTGAAGTTG CTATATCTGAGGAGTTGGTTCAGAAGT 3 52 6 

Qy 3437 ACAGTAATTCTGCTCTTGGTCATGTGAACAGC^ 34 96 

llfllll III lllllllllll llllll MM Mill Mill Mill II MM 

Db 3527 ACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCT 3586 

Qy 34 97 TAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATG 3556 

MMIMMIMMIMIMM IIIIMIII! Mill MM 

Db 3587 TAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATG 364 6 

Qy 3557 TTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTA 3616 

IIIIMIIIIII II ;l !!! 1 1 II 1 1 1 1 1 III 1 1 1 Mill II llllllllllll 

Db 3647 TTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTG 3706 

Qy 3617 TTCCTGTTATTTATGAACGGCATCAGGTG(^GATAGATOVTTATCTAGGACTTGCAAACA 3676 

II 1 1 1 II II II II M II M II II I II I II I II II M I M 1 1 1 II 1 1 II I M M M I I 

Db 3707 TTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATA 3766 



Qy 



3677 



AGAGTGTTAAGGATG CCATGG CCAAAAT CCAAG CAAAAATCC CTGGATTGAAG CG CAAAG 3736 



Db 



3767 




Qy 



3737 



CAGA 3740 



Db 



3827 



CTGA 3830 



RESULT 7 
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PT New purified disease detection and treatment molecule proteins and 

PT polynucleotides, useful for diagnosing, treating or preventing cancers 

PT (e.g. leukemia or sarcoma), anemia, Crohn's disease, AIDS, osteoporosis 

PT or hepatitis 

XX 

PS Claim 1; SEQ ID NO 124; 339pp + Sequence Listing; English. 
XX 

CC This invention describes a novel disease detection and treatment molecule 

CC polypeptide (MDDT) which has ant i- inflammatory, immunosuppressive, 

CC osteopathic, cytostatic, anti-HIV, haemostatic, nephrotropic , 

CC antianaemic, antipsoriatic and hepatotropic activity. The polynucleotides 

CC and the polypeptides of the invention can be used for gene therapy, 

CC protein replacement therapy and are useful for treating a variety of 

CC diseases or conditions. These polypeptides or polynucleotides are 

CC particularly useful for diagnosing, treating or preventing cell 

CC proliferative disorders (e.g. cancers including adenocarcinoma, 

CC leukaemia, lymphoma, melanoma, myeloma or sarcoma), anaemia, Crohn's 

CC disease, acquired immunodeficiency syndrome (AIDS), Goodpasture's 

CC syndromes, inflammation, osteoporosis, thrombocytopaenia, psoriasis or 

CC hepatitis. ABX34440-ABX34835 encode the MDDT polypeptides represented in 

CC ABU11450-ABU11845, described in the disclosure of the invention. 

CC NOTE: The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format from WIPO at 

CC ftp.wipo. int/pub/published_pct_sequences . 

XX 

SQ Sequence 4698 BP; 1410 A; 1028 C; 1022 G; 1238 T; 0 other; 

Query Match 61.4%; Score 2297.4; DB 25; Length 4698 ; 

Best Local Similarity 80.7%; Pred. No. 0; 

Matches 2996; Conservative 0; Mismatches 596; Indels 121; Gaps 22; 
Qy 134 CACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCC(^CCCCaVCAACCGCCCGCGACT 193 

I II III I I III 1 1 1 1 1 1 lllll I I I : I ! 1 1 1 1 1 1 1 I 1 : 1 1 1 ! 

Db 23 CTCGGCTC^GTCGGCCCAGCCCCTCT(^GTCCTCCC(^CCCCCAau\CCGCCCGCGCTC 82 

Qy 194 CTGAGGAGAAGCGGCCCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCGACCCGCCAGCCA 253 

Mill I I I llllllll I INN INI II I MM Illllll 

Db 83 CTGAGACGCGCCCCGGCGGCGGCGGC^GCAGCTG(^GC^TCATC-TCCACCCTCCAGCCA 141 

Qy 254 TGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCTC 313 

MINIM I II II M II I lllll MMIM IMMIMM llllllll I 

Db 14 2 TGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCGC 198 

Qy 314 CG CCCGC CTTCAAGTACCAGTT CGTGACGGAGC CCGAGGACGAGGAGGACGAGGAGGAGG 373 

MMM MINIMI MINI I M I M 1 1 1 II II II II 1 1 II II MM 

Db 199 AG CCCG CGTTCAAGTACCAGTT CGTGAGGGAG C CCGAGGACGAGGAG GAAGAAGAGG 255 

Qy 374 AGGAGGACGAGGAGGAGGACGACGAGGAC CTAGAGGAACTGGAGGTG CTGGAGAGGAAG C 433 

Illllll llllllll II lllll lllll lllll I M MMM MINI M 

Db 256 AGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAGCTGGAGGTGCTGGAGAGGAAGC 315 
Qy 434 CCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCGCCGCGCCGCTGC 4 87 

MM Illllll Illllll II Mill I II MMIM MMM Ml 

Db 316 CCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTGA 375 

Qy 48 8 TGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCCC 54 7 

llllllll I I MM lllll II II MM II II IMMMIMI III 



Db 



376 TGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTTCCC 435 



Qy 


548 


CTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCCG 

i i mi ii mi i i i i i i i ii i i i i i i t i i i i i i i i i i i i 

1 1 1 II 1 II 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCCG 


598 


Db 


436 


495 


Qy 


599 


CGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGC 
I I i I i i i i i ii i t i i i i i i j i i i i i i i i i i i i i \ i i t i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 II 1 1 II II M M II M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAGC 


658 


Db 


496 


555 


Qy 


659 


CTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 
i i i i i i i tiii ii i i i i i it ii mi mi i i i i i i i i iii iii 
1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II III III 1 1 1 1 1 1 1 1 III III 

CTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTGT 


711 


Db 


556 


615 


Qy 


712 


CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGCA 

i i i i i i t i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i t 
1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 

GGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGCA 


751 


Db 


616 


675 


Qy 


752 


GGGGCTCC GGCTCAGTGGATGAGACCC - - TTTTTGCTCTTCCTGCTGCATCTGAGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 i 


805 


Db 


676 


1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGGGCTCCTCGGGCTCAGATGGATGAGACCCATTTTTGCTCTTACCTGCTGCATCTGAGC 


735 


Qy 


806 


CTGTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAGGTAACACTG 
i i i i i i i i i i i i i i i i i iti i i i i i i tiii i i i i i i i i i i i i i i i i i i i 


865 


Db 


736 


1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

CTGTGATACGCTCCTC - - ATG CAGAAAATATGGACTTGAAGGAG CAG C CAGGTAACACTA 


793 


Qy 


866 


TTTCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTT 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i t i i i i i i i t i i i i i i i i i i i i i i i i i i 
1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 
TTTCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTT 


925 


Db 


794 


853 


Qy 


926 


CTCTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAG 
Mil 1 1 1 1 i 1 i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i ii iii 

MM 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 

CTCTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAA 


985 


Db 


854 


913 


Qy 


986 


CAGTGTCATCCTCAGAAGGAACAATTGAAG-AAACTTTAAATGAAGCTTCTAAAGAGTTG 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 i 1 1 11 ill ill t I i I 1 I I I I i i i i i i r i i i i 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II Ml III 1 1 1 M 1 1 1 II 1 1 1 1 1 II 1 1 1 

CAGTATTACCCACTGAAGGAACACTTCAAGAAAAATGTCAGTGAAGCTTCTAAAGAGGTC 


1044 


Db 


914 


973 


Qy 


1045 


CCAGAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAGAATTAGAA 
i i i i i i i i i i i ii ii i i ii i i t i i i t i i i i i i i i i i i t i i i i i I i i i i i 

II 1 1 1 1 II 1 1 1 II II 1 1 II 1 1 M 1 II 1 1 II 1 1 II II 1 II 1 II 1 1 M 1 II 

T CAGAGAAGG CAAAAACT CTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAA 


1104 


Db 


974 


1033 


Qy 


1105 


TATTCAGAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCATATTAGTA 
ii i i i i i i i i i i i i i i i i i ii i i iii i i i t i i i iii t i iii ii i i i i i 

II 1 1 1 1 M 1 1 II 1 II 1 1 1 1 II 1 1 III 1 1 II 1 1 1 III II III II II 1 II 

TACTCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTA 


1164 


Db 


1034 


1093 


Qy 


1165 


GAAAACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGT 


1221 


Db 


1094 


1 III III 1 INI Mil I MM M Mill 1 MUM II 

GCAAATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGT 


1153 


Qy 


1222 


AGTGCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAA 

II MM II 1 II II MM 1 MM Ml MM II 

AATAACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAG 


1266 


Db 


1154 


1213 


Qy 


1267 


GACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTA 

II MIIIIMM IIIMM III MM Mill 1 1 MM 

GATGAAGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTG 


1326 


Db 


1214 


1273 



Qy 1327 GTAG CAC CTGTGAGGGAAGAGTATG CAGACTTTAAGCCATTTGAACAAG CATGGGAAGTG 1386 

I III III lllllll II lllllllllll II llllllll I II lllllllill 

Db 12 74 GAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 1333 

Qy 1387 AAAGATACTTATGAGGGAAGTAGGGATGTG CTGG CTG CTAGAGCT AAT 1434 

lllllll I III I III III II llllllll III I I! 

Db 1334 AAAGATA- - -GTAAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAi(4$$([:$4$4GC^$5 ffiXSO 

Qy 1435 GTGGAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGG 14 94 

1 1 ' ! 1 1 1 M 1 1 1 1 1 1 I MINI II I MINIMI I MM I II I 

Db 13 91 TTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAA 14 50 

Qy 14 95 AAGGATAGTGAAGG CAGAAATGAGGATGCTTCTTT CC CCAGTACCCCAGAAC CTGTGAAG 1554 

II llllllll I II Mill III MIMMIMIMIM MUM I I III 

Db 14 51 AAAGATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAG 1510 
Qy 1555 GACAGCTCCAGAGCATATATTACCTGTGCTTCCTTTA CCTCAGCAACCGAAAGCACC 1611 

II I II lllllllill II MUM MM I MIMM M MM 

Db 1511 GATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATT 1570 

Qy 1612 AC^GCAAAC^CTTTCCCTTTGTTAGAAGAT 1671 

II MUM III MM MM Mill IIIIIIIIIIIMII II IMIMMI 

Db 1571 G CAACAAACATTTTTCCTTTGTTAGGAGAT C CTACTTCAGAAAATAAGACCGATGAAAAA 163 0 

Qy 1672 AAAATAGAAGAAAGGAAGG C CCAAATTATAACAGAGAAG ACTAG CCC CAAAACGTCA 1728 

Hill; !l II IMIIMIMM lllllllllll MINI lllllll III 

Db 1631 AAAATAGAAGAAAAGAAGG C C CAAATAGTAACAGAGAAGAATACTAG CAC CAAAACATCA 1690 

Qy 172 9 AAT C CTTTC CTTGTAG CAGTA CAGGATT CTGAGG CAGATTATGTTA CAACAGATA C CTTA 178 8 

II Mill lllllllill IIIIIIIIIIMI I 

Db 16 91 AACCCTTTTCTTGTAGCAGCACAGGATTCTGAGAC^GATTATGTC^C^C^GATAATTTA 1750 

Qy 1789 TCAAAGGTGACTGAGGCAGCAGTGTCAAAC^ 184 8 

Mill MINI MM II III llllllllllllllll Mill MIMM MM 

Db 1751 ACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTA 1810 

Qy 184 9 (^GGAAG(^TGTGAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAA(^ 1908 

IMIMMI II MIMM MM I 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 M ! 1 1 1 1 1 1 

Db 1811 CAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACA 1870 

Qy 19 09 AAAGTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATC^ 1968 

III MIMM MINIMI Ml Mill IMM II II I MM 

Db 18 71 AAAATGGACTTGGTTCAAACATCAGAAGTTATC 1930 

Qy 1969 CTTTGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTT 2 028 

MIM IMIII|4$|s 1 Mill Mill I ! I ! 1 1 1 1 ! I 1 1 1 1 1 MUM 

Db 1931 CTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTT 1990 

Qy 2 029 ATGGAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGT 2 088 

IMIMMI MUM MM II lllllllllll II I llllllll 

Db 19 91 ATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGC 2 05 0 

Qy 2 089 GTATCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAA 214 8 

III III I MM I! II MINI MM II Mill I lllllllill 

Db 2 051 TCATCACCATTAGAAG CTTCTTCAGTTAATTATGAAAG CATAAAACATGAG C CTGAA 2107 



Qy 214 9 AACCCCCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAGCTTTGGGAACAAAG 2205 

I h I I I I M I I ! I I M I I I lllllll I I I I I I I I Mill I I I I I I I I I 
Db 2108 AAC CCCCCACCATATGAAGAGGC CATGAGTGTATCACTAAAAAAAGTAT CAGGAATAAAG 2167 

Qy 22 06 GAAGGAATAAAAGAGC CTGAAAGTTTTAATG CAG CTGTTCAGGAAACAGAAG CTCCTTAT 2265 

I I I I III IIIIIIIIIIMI I IIIIIIIIMI I I I I 

Db 2168 GAAGAAATTAAAGAGC CTGAAAATATTAATG CAGCTCTTCAAGAAACAGAAG CTCCTTAT 2227 

Qy 2266 ATATCC^TTGCGTGTGATTTAATTAAAGAAAOW^GCTCTCaVCTGAGCCAAGTCCAGAT 2325 

Mill Mill M 1 1 M 1 1 It 1 1 1 M I M It 1 1 i 1 1 1 II MM Ml III III 

Db 2228 ATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGAT 2287 

Qy 232 6 TTCTCTAATTATT CAGAAATAGCAAAATTCGAGAAGTCGGTG CCCGAACACG CTGAG CTA 2385 

MIMI IMIMMMIM MUM III III MM II II llllllll 

2 Db 2288 ATTCA.GAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTC^QAGCTAAg^i^ 

Qy 23 86 GTGGAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCT 2445 

II II IIIIIIIIIMIII MM! M MMM MMM M MM; II III 

Db 234 8 GTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCT 24 07 
Qy 244 6 GAAGTCCCACAAACACAAGAGGAGGCTGTGM 25 01 

II II MMM MMM II MM MM MM II M MMM 

Db 24 08 GACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACT 2467 

Qy 2 5 0 2 - - AGTGTCTGAGACAGTAGCC(^G(^(^WVGAGGAGAGACTTAGTGCCTC^CCTC^VGGAG 255 9 

II I III I Ml MM I III Mill I II I III 

Db 24 68 TCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGCCACCTGAG 2527 

Qy 2560 CTAGGAAAG C CATATTTAGAGTCTTTT CAG CCCAATTTACATAGTACAAAAGATG C - - -T 2616 

Ml MMM M II MMM II! II III 

Db 2 528 GGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTG 2587 

Qy 2 617 GCATCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGC^^ 2676 

I II 1 1 1 1 II lllllllll llllllllllllllll lllllll Mill III 

Db 258 8 TTACCTGATGAAGTTTCAACATTGAG CAAAAAGGAGAAAATTCCTTTG CAGATGGAGGAG 2647 

Qy 2677 TTTAATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAA 2736 

I I lllllll MIMIIMIIIMIIMM M 1 1 1 1 1 I ! I MM I 

Db 2648 CTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGA 2707 

Qy 2737 GAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTT 2796 

MM MMM MMMl MMM Mill II llllllll II M II II 

Db 2708 GAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTG 2767 
Qy. 27 97 GT CAGTG CTAAAGATGATT C TC CTAAATTAG C CAAGGAGTACACTGATCTAGAAGTA 2853 

Mill Mill MMM I ! 1 1 1 1 1 1 1 1 1 1 1 III II Mill lllllllll 

Db 27 68 ATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTA 2827 

Qy 2 854 TCCGAt^VAAAGTGAAATTGCTAATATCCAAAGCGGGGC^GATTCATTGCCTTGCTTAGAA 2 913 

Ml IIIIIIIIIMIIIIIMII II II M I IMMMMMI MM 

Db 282 8 TCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAA 2887 
Qy 2 914 TTGCCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTT 2 967 

MMM 1 1 1 1 1 1 1 1 1 1 1 Mill III I II MM Mill I I I 

Db 2 888 TTGCCC(^TGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTC 2 947 

Qy 2 968 TCAGATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAAT 3027 



Ill MM II II 

TATTGCCTCCAGAT 3007 



Db 


2948 


TCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGAT 


3007 


Qy 


3028 


GTCTCTGCTTTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCAAATCACTTACG 


3087 


Db 


3008 


M 1 1 1 1 1 1 1 1 1 1 MM MM II 1 1 1 1 1 1 1 1 h 1 Mill III 1 

GTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTG 


3067 


Qy 


3088 


AAAGAAG CAGAGAAAAAACTTC CTTCTGACACAGAGAAAGAGGACAGAT CCCTGT CAGCT 

MINIM lllllllllllllllll II Mill MIMIMMMM 1 II III 

AAAGAAG CTGAGAAAAAACTTCCTT C CGATACAGAAAAAGAGGACAGATCACCAT CTGCT 


3147 


Db 


3068 


3127 


Qy 


3148 


GTATTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATT 

MM MMMMMMMMMMMMIMMMMIMM MINI MUM! 

ATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATT 


3207 


Db 


3128 


3187 


Qy 


ENGTH2 0 


GACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTC 

MMM MMIMM Ml II III MUM II 1 1 MM Ml 

AAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTC 


3267 


Db 


3188 


3247 


Qy 


3268 


AGCATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTT 

Illlllll II Mill llllllllllllllllllllllllll Mill lllllllll 

AGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTT 


3327 


Db 


3248 


3307 


Qy 


3328 


AGGATATATAAGGG CGTGATC CAGG CTATC CAGAAATCAGATGAAGG CCACCCATTCAGG 

II MM II Mill Illlllll MMMMMMMMIMMMMMMMIMM 

AGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGG 


3387 


Db 


3308 


3367 


Qy 


3388 


GCATATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCT 

llllll 1 MMMIMMMMIMM Mill Illllllllll MIMMMMI 

GCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACA | | | ATTCT 


3447 


Db 


3368 


3427 


Qy 


3448 


GCTCTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGAT 


3507 


Db 


3428 


MMMIMMMIMM MM Mill Mill Mill M lllllllllllllll 

GCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGAT 


3487 


Qy 


3508 


TTAGTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTG 

Illllllllll MMMMMMIMMMMIMM Mill IIIIIIMIIIIIII 

TTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTG 


3567 


Db 


3488 


3547 


Qy 


3568 


TTCAATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATT 

M M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II II 1 II II II 1 II II 1 1 1 1 1 1 1 1 1 
TTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATT 


3627 


Db 


3548 


3607 


Qy 


3628 


TATGAACGG CAT CAGGTG CAGATAGATCATTATCTAGGACTTG CAAACAAGAGTGTTAAG 

MMMMIMIMM IMIIIIIIMIIIIIIIIIIIIIIIIII MM llllll 

TATGAACGG CATCAGGCACAGAT3 Y 50XC CTAGGACTTG CAAATAAGAAtCGTTfififi 


3687 


Db 


3608 


2667 


Qy 


3688 


GATGCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGA 374 0 

Mill Mill MMMMMMIMMMMMIMIMMMIMM II 

GATGCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGA 372 0 




Db 


3668 





RESULT 8 
AAZ56886' 

ID AAZ56886 standard; DNA; 3579 BP. 
XX 

AC AAZ56886; 



XX 

DT 25-APR-2000 (first entry) 
XX 

DE Human MAGI polypeptide encoding DNA . 
XX 

KW MAGI protein; neuroendocrine- specif ic protein; neuropathy; human; 

KW spinal injury; neuronal degeneration; neuromuscular disorder; cancer; 

KW psychiatric disorder; developmental disorder; inflammatory disorder; 

KW stroke; cytostatic; cerebroprotective ; neuroprotective; ds . 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1 . .3579 

FT /*tag= a 

FT /product= "MAGI polypeptide" 

XX 

PN WO200005364-A1 . 
XX 

PD 03-FEB-2000. 
XX 

PF 21-JUL-1999; 9 9WO-GB023 6 0 . 
XX 

PR 22-JUL-1998; 98GB-0016024 . 

PR 19-JUL-1999; 99GB-00168 98 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha RK; 
XX 

DS WPI; 2000-182603/16. 

DR P-PSDB; AAY569 

XX 

PT Novel polypeptides related to neuroendocrine-specif ic proteins and 

PT polynucleotides useful for diagnosis of various diseases and for 

PT treatment of cancer and neurological disorders - 
XX 

PS Claim 5; Page 19-20; 35pp; English. 
XX 

CC The invention relates to human MAGI protein, which is similar to 

CC neuroendocrine-specif ic protein. The MAGI protein can be expressed by 

CC standard recombinant methodology. The MAGI polypeptides, polynucleotides 

CC and antibodies are useful for treating diseases, including neuropathies, 

CC spinal injury, neuronal degeneration, neuromuscular disorders, 

CC psychiatric disorders and developmental disorders, cancer, stroke and 

CC inflammatory disorders. The polynucleoitde is also useful for chromosome 

CC localization and for tissue expression studies. The present sequence 

CC represents a DNA encoding the human MAGI protein. 

XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 other; 

Query Match 61.2%; Score 2289.2; DB 21; Length 3579; 

Best Local Similarity 81.5%; Pred. No. 0; 

Matches 2925; Conservative 0; Mismatches 548; Indels 117; Gaps 19; 
Qy 253 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 312 

MINIMI I 1 1 1 1 ! 1 1 1 I Mill 1 1 1 ! 1 1 1 I r 1 1 1 1 J 1 1 r 1 1 1 1 1 1 1 1 



Db 



1 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 57 



Qy 313 CCG CCCG CCTTCAAGTACCAGTTCGTGACGGAG C C CGAGGACGAGGAGGACGAGGAGGAG 372 

I Mllll IMIM Mill! Ill IIMIIIIIIIMIIIMI II II III 

Db 58 CAG CCCG CGTTCAAGTACCAGTTCGTGAGGGAG C C CGAGGACGAGGAG GAAGAAGAG 114 

Qy 3 73 GAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAG 432 

Mill IMIM II Mill Mill Mill IIIIIIMI MIMI I 

Db 115 GAGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAGCTGGAGGTGCTGGAGAGGAAG 174 



Qy 433 CC CG CAG C CGGG CTGTC CGCAG CTG CGGTGC CGCCCGCCGCCGCCGCGCCGCTG 486 

IIIII MM M| |M II I MM I II IMMM Mllll III 

Db 175 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 234 

Qy 4 87 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 54 6 

III IIIII I I MM IIIII II MUM II II III 1 1 MII#o|| | 

Db 235 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 2 94 
Qy 54 7 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 597 

II I Ml II III IMMM M Mill I Mill I I I MM 

Db 2 95 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 354 

Qy 5 98 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

Mllll II I I MINIMI MUM II III I Mllll I 

Db 3 55 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 414 
Qy 658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

IMMM MM II Mill II II III III IIIIIMI III III 

Db 415 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 474 



Qy 712 CCCGCCGCGCCCCCTTCCACGCCGGCA CAAGCGC 1660 

Mill Mllll Mill MMMIMMMIMM 

Db 4 75 TGGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 534 
Qy 751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 807 

IIIIIIMI Ml MIMMMMIMM IMIM IMMIMI Mllll I 

Db 535 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 594 

Qy > 8 08 GTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAGGTAACACTGTT 867 

IMMM IMIM I III Mllll MM I II II II 1 1 1 II II 1 1 II I II 

Db 5 95 GTGATACGCTCCTCTGCAGAAAA - - - TATGGACTTGAAGGAGCAGCCAGGTAACACTATT 651 

Qy 868 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 927 

III Mill IIIII MMIMII IIIIIMIMI llllllllll I! IMMIIIIMI 

Db 652 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 711 

Qy00X9 92 8 CTATCTCCTCTCTCAACTGTTT CATGGATACCTTGGTAACTTAGfiEffi(C® 9820 

II IMMIIIIMI I I Mllll IIIIIMIM I II Mill II Ml II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 988 GTGTCATCCTt^GAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 1047 

II I I II I IIIIIIMI II Mllll I I I I I h I I I II 

Db 772 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 1048 GAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAGAATTAGAATAT 1107 

MM IIIII II II I I II MIMIIMM MM Mi 1 1 II I II 1 1 1 1 II 

Db 832 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 891 



Qy 1108 TCAGAAATGGGATCATCTTTTAAAGG CT C CCCAAAAGGAGAGTCAG CCATATTAGTAGAA 1167 

lllllllllllllllll II I I III lllllll III II III II llllll I 
Db 8 92 TCAGAAATGGGATCATCGTT CAGTGT CT CTC CAAAAGCAGAAT CTG CCGTAATAGTAGCA 951 

Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA- - - GACAAAGAGGATTTAGTTTGTAGT 1224 

II III lllllll I I II I I I I I I I I I II Mill I MINI III I 
Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 1269 

llllll I II llllll I MM Ml llllll II 

Db 1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 127 0 AGAGTTGTGT CT CCAGAAAAGACAATGGACATTTTTAATGAAATG CAGATGTCAGTAGTA 132 9 

lllllll lllllll Ml MM MINI II I I I MM I I 

Db GAAGTTGTGTCTTCAGAAAAAGQiAAAGACAGTTTTAATGAAAAGAGAGTTGCAGCFGGAA 228 01 

Qy 133 0 G CAC CTGTGAGGGAAGAGTATG CAGACTTTAAGCCATTTGAACAAG CATGGGAAGTGAAA 13 89 

II III Mill II IIIM II II II II 1 1 1 1 I II llllll MM 

Db 1132 G CT C CTATGAGGGAGGAATATG CAGACTTCAAACCATTTGAG CGAGTATGGGAAGTGAAA 1191 
Qy 13 90 GATACTTATGAGGGAAGTAGGGATGTG CTGG CTG CTAGAGCT AATGTG 1437 

MM I III I Ml III II M MM Ml I II II 

Db 1192 GATA GTAAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAACTTG 124 8 

Qy 1438 GAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAG C CTGGAG CAAAAAAGT CTTGGGAAG 14 97 

1 1 1 1 1 1 1 I llllll II I MIIIIMI lllllll I II I II 

Db 124 9 GAAAGTAAAGTGGATAAAAAATGTTTTG CAGATAG CCTTGAG CAAACTAAT CACGAAAAA 13 08 

Qy 14 98 GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 1557 

MM III I I M I II 14 III II 1 1 II I II I II MM I 'Ml 2460 

Db 13 09 GATAGTGAGAGTAGTAATGATGATACTTCTTTCCC CAGTACG CCAGAAGGTATAAAGGAT 13 68 

Qy 1558 AGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCT - - - CAGCAACCGAAAGCACCACA 1614 

I I Mill I II MMM llllll I IMMM II MM II 

Db 1369 CGTCCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGA 1428 

Qy 1615 GCAAACACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAAC^ 1674 

llllll III MMMMM Mill I MMM M M M MMMIIMM 

Db 142 9 ACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAAAAA 148 8 
Qy 1675 ATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 1731 

IMIMIMI MMMMM Mill MM llllll IMMM IIIM 

Db 148 9 ATAGAAGAAAAGAAGG C C CAAATAG TAA CAGAGAAGAATA C TAG CAC CAAAA CAT CAAA C 154 8 

Qy 1732 CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 17 91 

Ma MMM I MM II MM II MMMMM IMIMIMI III It 2940 

Db 154 9 CCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTAACA 1608 

Qy 17 92 AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 1851 

MMMMM III M III MMMMM III Mill lllllll MM III 

Db 160 9 AAGGTGACTGAGGAAGTCGTGG CAAACATGC CTGAAGG C CTGACT CCAGATTTAGTACAG 1668 

Qy 1852 GAAG CATGTGAAAGTGAACTGAATGAAG C CACAGGTACAAAGATTG CTTATGAAACAAAA 1911 

IIMIMIIIIIIIIMI MIIIIMI Mill II Mill- MMM Ml 

Db 1669 GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAA 1728 



Qy 1912 GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCACTTT^ 1971 

lillll lllill III III Mill Ml II II IMIMIIIII 

Db 172 9 ATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGT 178 8 

Qy 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2 031 

llllllllllllll II I Mill IMM IIIMIIIIIIIIilll IMIMIII 

Db 1789 TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 1848 

Qy 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2 091 

MINI III: llllll MM II MINI Ml II I II I II I M I 

Db 184 9 GAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCA 1908 

Qy 2 0 92 TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 

II III I MM II II llllll MUM 1 1 Mill I lllllllllllll 

Db 1909 TC00XA AAG CTT CTTCAGTTAATTATGAAAGCATAAAACATGAG C CTGAAAAG 3965 

Qy 2152 CC C CCAC CATATGAAGAAG C CATGAATGTAG CACT AAAAGCTTTGGGAACAAAGGAA 22 08 

IIIIIMIIIIIIIIM Mill MM MM Mill I MM MM 

Db 1966 CCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAGGAA 2 025 

Qy 22 09 GGAATAAAAGAGCCTGAAAGTTTTAATG CAG CTGTT CAGGAAACAGAAGCTC CTTATATA 2268 

I III IMM MM I MM M I MM II 1 1 II 1 1 M II II II II I II 

Db 2 026 GAAATTAAAGAGCCTGAAAATATTAATG CAG CTCTT CAAGAAACAGAAGCTCCTTATATA 2 08 5 

Qy 22 69 TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 232 8 

II Mill lillllllllllllllllllllllll II MM III III llllll 

Db 2 086 TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 214 5 

Qy 2329 TCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTG 238 8 

III I Mill IM llllll III II I Mill II 

Db 2146 TCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTT 22 05 

Qy 23 89 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 244 8 

II MIMMMMIM I II , I I I I II I! i N II Mill 

Db 22 06 GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 2265 

50XA A| |$^A 3480 

Qy 244 9 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGA A 2502 

II MINI llllll II IIMMIMI MM II 1 1 1 ; 1 1 1 1 1 1 1 I 

Db 2266 GTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCA 2325 

Qy 2503 GTGTCTGAGACAGTAGCCCAGCAGAAAGAGGAGAGACTTAGTGCCTCACCTCAGGAGCTA 2562 

I I III I I II MM I III Mill I II I III I 

Db 2326 TTTGAGT CAATGATAGAATATGAAAATAAGGAAAAACTCAGTG CTTTG CCACCTGAGGGA 2385 
Qy 2 563 GGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCA 2619 

MINIMI 

Db 2386 GGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTGTTA 2445 

Qy 2620 TCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTT 2679 

II MM II IIIIIMM llllll INININI lllllll Mill III I 

Db 24 46 CCTGATGAAGTTT CAACATTGAG CAAAAAGGAGAAAATTCCTTTG CAGATGGAGGAG CTC 25 05 

Qy 268 0 AATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAA 273 9 

I lllllll MMMMMIMMIMM II MM Mill I MM MM 

Db 2506 AGTACTGCAGTTf4TT<PM4^^ 2565 

Qy 274 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 27 99 



I IN I 1 1 1 1 1 1 i 1 1 1 1 1 - 1 i I 1 1 1 1 1 II IIMIIII 1 1 II II II II 

Db 2566 ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 2625 
Qy 28 00 AGTGCTAAAGATGATTC TCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCC 28 56 

III Mill MM I 1 1 ( 1 1 1 1 1 ! 1 1 1 III II Mill MMIMIMM 

Db 2 626 AGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 2685 

Qy 2 857 GACAAAAGTGAAATTG CTAATATCCAAAGCGGGG CAGATT CATTG CCTTG CTTAGAATTG 2 916 

MMMMMMIMIMM II II II I MIMMIMM IIIMM 

Db 2 686 CACAAAAGTGAAATTG CTAATG CCC CGGATGGAG CTGGGT CATTG CCTTG CACAGAATTG 274 5 

Qy 2 917 CCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 2970 

III MIMMMIM Mill Ml I 1 1 MM Mill I I I III 

Db 274 6 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCA 28 05 

Qy DN1 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGC^TCCATATCGCCTTC^ATGTC 303 0 

Mill II II Mill MM I III MM I III MM II MM 

Db 28 06 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 28 65 

Qy 3 031 TCTGCTTTGGAACCTCA.GACAGAAATGGGCAGCATAGTTAAATCCAAATCACTTACGAAA 30 90 

IIIIMIMI MM MM II I MIMMMIM Mill III MM 

Db 2866 TCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAA 2925 

Qy 3 091 GAAG CAGAGAAAAAACTTCCTT CTGACACAGAGAAAGAGGACAGATCCCTGT CAGCTGTA 3150 

Mill IMIIIIIIMIIIIII II Mill 1 1 i I J I E 1 1 1 1 1 1 1 I II III II 

Db 2 92 6 GAAG CTGAGAAAAAACTTCCTT C CGATACAGAAAAAGAGGACAGATCAC CAT CTGCTATA 2985 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

II MMMMMIMMMMMMMMMMMIMM MMMMMIMIMM 

Db 2 98 6 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCEffiG2JaCTG$A AAG 3045 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 32 7 0 

IIIIIIIIIIIIIIIIIIIIIIMIII illllllllllll II IIIMM llllll 

Db 3 046 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGC 3105 

Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 333 0 

Mill II Mill MMMMMMMMIMMMIM Mill MIMMMIM 

Db 3106 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 3165 

Qy 3331 ATATATAAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 33 9 0 

Mill Mill IIMIIII IIIIIMIIIIIMIIIIIIIIIIIIIMIMMIIIII 

Db 3166 ATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGa^ 322 5 

Qy 33 91 TATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCT 34 50 

Ml I IIMMMIMMMIMM Mill MIMMIIM MMMMMIMM 

Db 3226 TATCTGGAATCTGAAGTTGCT*^AS'CZ5 , SSGGAGTTGGTTCAGAAGTACAGTAATTCTGCT 32 85 

Qy 34 51 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3510 

111,11 1 1 1 1 Mill Mill Mill II M I I ! I I 

Db 3286 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 334 5 

Qy 3 511 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 3570 

IIMIIII MMMMMMMMMMIIMM Mill MMMIMIMIMM 

Db 3346 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 34 05 

Qy 3 571 AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 363 0 

I I I M 1 1 ! 1 1 1 1 1 1 1 1 ! 1 1 1 1 ! I Mill || II 1 1 1 II 1 1 1 II MIMMMIMM 



Db 


3406 


AATGGTCTGACACTACTGATTTTGGCTCTCATTTGA.CTCTTCAGTGTTCCTGTTATTTAT 


3465 


Qy 


3631 


GAACGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGAT 

MINIM; II IIMM MIIIMM MIMMM INI Ml III 

GAACGG CATCAGG CG CAGATAGATCATTAT CTAGGACTTGCAAATAAGAATGTTAAAGAT 


3690 


Db 


3466 


3525 


Qy 


3691 


GCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGA 374 0 

II Illll MIMM MMIMIMIMM IMIIIM! 1 M 

GCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGA 3575 




Db 


3526 





RESULT 9 
AAF90324 

ID AAF90324 standard; cDNA; 3579 BP, 
XX 

AC AAF90324; 
XX 

DT 23-JUL-2001 (first entry) 
XX 

DE Human NOGO-A cDNA . 
XX 

KW NOGO-A; human; chromosome 2p21; neuropathy; spinal injury ; 

KW brain injury; stroke; neuronal degeneration; Alzheimer's disease; 

KW Parkinson's disease; neuromuscular disorder; psychia ric disorder; T 

KW developmental disorder; neuroprotective; nootropic; neuroleptic; 

KW antiparkinsonian; cerebroprotective; neuroleptic; diagnosis ; 

KW therapy; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200136631-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 14-NOV-2000; 2 000WO-GB04345.. 
XX 

PR 15-NOV-1999; 99GB-0026995 . 

PR 24-JAN-2000; 2 00 0GB- 000155 0 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC . 
XX 

PI Michalovich D, Prinjha R; 
XX 

DR WPI; 2001-343822/36. 

DR P-PSDB; AAB82349. 
XX 

PT New polypeptide designated NOGO-C is a splice variant of the human NOGO 

PT gene and may be useful in the treatment of neural disorders including 

PT Alzheimer's and Parkinson's diseases 
XX 

PS Disclosure; Page 25-26; 25pp; English. 
XX 

CC The present sequence is that of cDNA encoding human NOGO-A (see 

CC AAB82349) . NOGO-A is a previously known splice variant G 1167 

CC human NOGO gene on chromosome 2p21. NOGO-A cDNA was obtained by 

CC PCR amplification of human spinal cord cDNA . The invention 

CC relates to a novel splice variant, NOGO-C (see AAF90323) . It 



CC provides NOGO-C polypeptides and polynucleotides, and methods for 

CC producing such polypeptides by recombinant techniques. Also 

CC disclosed are methods for utilising NOGO-C polypeptides and 

CC polynucleotides in the treatment of diseases including neuropathies, 

CC spinal injury, brain injury, stroke, neuronal degeneration, for 

CC example Alzheimer's disease and Parkinson's disease, neuromuscular 

CC disorders, psychiatric disorders and developmental disorders. Also 

CC provided are methods for identifying agonists and agonists for 

CC use in treating conditions associated with NOGO-C imbalance, and 

CC diagnostic assays for detecting diseases associated with 

CC inappropriate NOGO-C activity or levels. 

XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 other; 

Query Match 61.2%; Score 2289.2; DB 22; Length 3579; 

Similarity 81.5%; Pred. K&CAG£GA7 -75XT 
Matches 2925; Conservative 0; Mismatches 548; Indels 117; Gaps 19; 

Qy 2 53 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 312 

1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I Mill 1 1 II I II MINIM II II I II I 

Db 1 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 57 

Qy 313 CCG CCCG CCTT CAAGTAC CAGTTCGTGACGGAG C C CGAGGACGAGGAGGACGAGGAGGAG 372 

I llllll IIMI III II II 1 1 II I II IIIIIIMIMMIIIIII II II III 

Db 58 CAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG GAAGAAGAG 114 

Qy 3 73 GAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAG 432 

MM: III M 1 1 Mill IIMI IIMI MIMIMMIMMMIMI 

Db 115 GAGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAGCTGGAGGTGCTGGAGAGGAAG 174 

Qy 433 C C CG CAG C CGGG CTGT C CG CAG C TG CGG TG C CGCCCGCCGCCGCCGCGCCGCTG 486 

Db 175 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 234 

Qy 4 87 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 546 

Mill I I MM Mill II MM II II ill: I III 

Db 235 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 2 94 
Qy 547 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 5 97 

II I II! II III Illllll II Mill I IIMI I I I MM 

Db 2 95 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 354 

Qy 5 98 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

III II I I llllllllllll I I N i Nihil 

Db 3 55 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 414 
Qy 658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

M I II M I MM II Mill II II III III III Mill III III 

Db 415 W^C^^ 491XC 
Qy 712 CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

Ml M IIIMM Mill IIMMIIIIIIIIIMI 

Db 4 75 TGGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 534 
Qy 751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 8 07 

MIMMM 1 1 1 1 1 I ;' I III I i , 1 1 1 1 1 1 . 1 1 1 1 1 1 ' 1 1 , 1 Mill' I i 1 1 1 1 I 

Db 53 5 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 5 94 



Qy 808 GTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAGGTAACACTGTT 867 

iiiiiii iiiiiiiiiiiiMi iiiiii i M i ! 1 1 ! i i i ill 1 1 ; 1 1 1 1 1 

Db 595 GTGATACGCTCCTCTGCAGAAAA TATGGACTTGAAGGAG CAG C CAGGTAACACTATT 651 

Qy 868 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 92 7 

III III MMM MMM llllll IMIIMII MMM llllllllllll 

Db 652 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 711 

Qy 928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 98 7 

II MM. Mill I I llllll MMMM llllllllllll II III II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 988 GTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 104 7 

II I Ml I MINIMI II llllll I I I II MMM MM I II 

Db 772 GTATTACGCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 104 8 GAGAGGGCAACAAAT C CATTTGTAAATAGAGATTTAG CAGAATTTTCAGAATTAGAATAT 1107 

1 1 1 1 Mill II II I i II lllllllllll 1 1 1 1 IIIIIIIIIIIIIIMI 

Db 832 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 891 

Qy 1108 TCAGAAATGGGATCAT CTTTTAAAGG CTCCC CAAAAGGAGAGT CAGCCATATTAGTAGAA 1167 

llllll MMM I II I I III Illllll III II Ml II MMM I 

Db 8 92 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 951 

Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGT 1224 

II III Illllll MM MM I MM II Mill I llllll Ml I 

Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 12 69 

llllll I II llllll I MM III MMM II 

Db 1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 1270 AGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTAGTA 132 9 

MIIIMIII Illllll III MM IMIMIMM I I MM I I 

Db 1072 GAAGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTGGAA 1131 

Qy 133 0 G CAC CTGTGAGGGAAGAGTATG CAGACTTTAAGCCATTTGAACAAG CATGGGAAGTGAAA 13 8 9 

II Ml Illllll II lllllllllll II MMMM I II MIMMIMIM 

Db 1132 GCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTGAAA 1191 

Qy 13 9 0 GATACTTATGAGGGAAGTAGGGATGTGCTGGCTGCTAGAGCT AATGTG 14 37 

MM I III I Ml III II llllllll III I M II 

Db 1192 GATA GTAAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAACTTG 124 8 

Qy 1438 GAAAGTAAAGTGGACAGAAAATG CTTGGAAGATAG CCTGGAG CAAAAAAGTCTTGGGAAG 14 97 

IMIIIIIIIIIII I llllll II I IIMI! I MMM llllll 

Db 124 9 GAAAGTAAAGTGGATAAAAAATGTTTTG CAGATAGC CTTGAG CAAACTAATCACGAAAAA 13 08 

Qy 14 98 GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 1557 

llllllll I II Mill III llllllllllllllll MMM I I Mill 

Db 13 09 GATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAGGAT 13 68 

Qy 1558 AGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCT CAGCAACCGAAAGCACCACA 1614 

I I MIIIMIII II MMM llllll I Illllll II MM M 

Db 1369 CGTCCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCA 142 8 



Qy 1615 GCAAACACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAA 1674 

MINI III llllllllll Mill III MINI II II MUM II; 

Db 142 9 ACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAAAAA 14 88 
Qy 1675 ATAGAAGAAAGGAAGG C C CAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 1731 

IMIIIIIII IMIIIIIIIII 1 1 1 M 1 1 M 1 1 llllll lllllll Mill 

Db 14 8 9 ATAGAAGAAAAGAAGG C C CAAATAG TAA CAGAGAAGAATA CTAG CA C CAAAACAT CAAAC 1548 

Qy 1732 CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 17 91 

Mill MIMIMII IMIIIIIII - 1 1 1 1 1 1 Mill! II Ml II 

Db 154 9 CCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTAACA 1608 

Qy 17 92 AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 1851 

MIMIMIMM II Ml IMIMMMMMII Mill MIMMIMI III 

Db 160 

Qy 1852 GAAG CATGTGAAAGTGAACTGAATGAAG CCACAGGTACAAAGATTG CTTATGAAACAAAA 1911 

1 1 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 1 I II 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 E M 1 1 1 

Db 166 9 GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAA 172 8 

Qy 1912 GTGGACTTGGTCCAAAC^TCAGAAGCTATACAAGAATCACTTTACCCCAC^GCACAGCTT 1971 

llllllllll MIMMMMM III Mill MM! M II MIMMIMI 

Db 172 9 ATGGACTTGGTTC^U^(^TC^GAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAGCTT 178 8 

Qy 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2 031 

MM II MM II I Mill Mill MMMMMMIMM 1 1 1 1 1 1 1 1 1 

Db 178 9 TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 1848 

Qy 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2091 

MIMMIMI llllll MM II MIMMMII II I llllllll I 

Db 184 9 GAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCA 19 08 

Qy 2 092 TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 

II Ml I MM II II llllll llllll II Mill I MIMIMII 

Db 1909 T CAC CATTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAAC 1965 

Qy 2152 CC CC CACCATATGAAGAAG C CATGAATGTAGCACT AAAAGCTTTGGGAACAAAGGAA 22 08 

lllllllllllllllll lllllll 1 1 1 1 1 1 1 1 Mill I 1 1 1 1 lllllll 

Db 1966 CC CCCACCATATGAAGAGG C CATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAGGAA 2 02 5 

Qy 2209 GGAATAAAAGAGCCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATA 2268 

I III lllllllllllll I IMIIIIIII 1 1 1 1 MINIMI III Mill II: 

Db 2 026 GAAATTAAAGAG CCTGAAAATATTAATGCAGCT CTTCAAGAAACAGAAG CTC CTTATATA 2085 

Qy 2269 TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 2328 

II Mill II lllllllllllllllll llllll II MM III III llllll 

Db 2 086 TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 2145 

Qy 232 9 TCTAATTATTC^GAAATAGC^AAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTG 2388 

Ml IMMMIMIM llllll III Ml Mill II M IMIIIIIII 

Db 214 6 TCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTT 22 05 

Qy 2389 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 2448 

II Mill IMMMMMMMMMMIMMMMM II Mill 

Db 22 06 GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 22 65 



244 9 ^GCTffifiKrcfff m GAAGgGGffi^nra^ 2668 



II IIIIIII MINI II IIIIIIIIII 1 1 1 1 II IIIIIIIIIII I 

Db 2266 GTTCCACAAAAACAAGATGAA 2325 

Qy 25 03 GTGTCTGAGACAGTAG CCCAG CACAAAGAGGAGAGACTTAGTG CCTCAC CTCAGGAG CTA 2562 

I I III I I II 1 1 1 1 I Ml Mill I II I Ml I 

Db 2326 TTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACT CAGTG CTTTGC CACCTGAGGGA 2385 
Qy 2563 GGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCA 2619 

MIMIMMIMI II MM Ml II MM III lllllllll I I 

Db 2386 GGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTGTTA 2445 

Qy 2 620 TCTAATGACATT CCAACATTGAC CAAAAAGGAGAAAATTT CTTTGCAAATGGAAGAGTTT 2679 

II MM II lllllllll MMMMMM IIIIIII Mill Ml I 

Db 244 6 CCTGATGAAGTTTCAACATTGAG CAAAAAGGAGAAAATTC CTTTGCAGATGGAGGAG CTC 2505 

Qy 268 0 AATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAA 273 9 

I MIMM MMMIMIMIMIMM II IMIIMMM I MM MM 

Db 25 06 AGTACTG CAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAG CACAGATAAGAGAA 2565 

Qy 274 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 2799 

I MM MMMMMMIMM Mill II MM II II II II II 

Db 2 566 ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 2625 

Qy 2 8 00 AGTGCTAAAGATGATTC - - - TCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCC 2856 

III Mill MUM I MMMMMM Ml II Mill MMMMM 

Db 2626 AGTTCTAAAACTGATTCATTTTCTAAATTAG C CAGGGAATATACTGACCTAGAAGTAT CC 268 5 

Qy 2857 GACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTG 2916 

MMMMM MMM II II 1 1 I IIIIIIMIMI IIIIIII 

Db 2 686 C7VCAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTG 2745 
Qy 2 917 C CCTGTGACCTTT CTTT CAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 297 0 

III IIIIIIMIMI Mill Ml I II MM Mill I I I III 

Db 2746 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCA 28 05 

Qy 2 971 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTC 303 0 

Mill II II Mill MM I III MM I Ml MM II MM 

Db 2806 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 28 65 

Qy 3 031 TCTGCTTTGGAACCTC^GACAGAAATGGGCAGC^TAGTTAAATCCAAATCACTTACGAAA 3090 

MMMMM MM MM II I MMMMM Mill III MM 

Db 2866 TCTGCTTTGGCC7VCTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAA 2925 

Qy 3 091 GAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTA 3150 

Mill MMM MMM M II Mill MMMMMM I II III II 

Db 2 926 GAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCTGCTATA 2985 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

II MMMMMMMMMMMMIMMMMIMM MIIMMIMMIMM 

Db 2 986 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAG 3045 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 3270 

I MMMMI MMM M.M MMMMM II IIIIIII MMM 

Db 3 04 6 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGC 3105 

Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 333 0 

Mill II Mill Mi MMM M Ml M MM Mill MMMMM 



Db 



3106 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 3165 



Qy 3331 ATATATAAGGG CGTGATC CAGG CTATCCAGAAATCAGATGAAGGC CACC CATTCAGGG CA 33 90 

Mill Mill Mill III llllll II llllll ILIMIII IMII 

Db 3166 ATATACAAGGGTGTGATC CAAG CTAT CCAGAAATCAGATGAAGGC CAC C CATTCAGGG CA 3225 

Qy 3391 TATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCT 34 50 

III I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 Mill lllllllllll Mllli IIIMII 

Db 3226 TATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCT 32 85 

Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3510 

IMMMMIIMM MM Mill Mill MM! II I II I II 1 1 II I II I II 1 1 

Db 3286 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 334 5 

Qy 3511 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 357 0 

IIIMII! MMMMMMMMIMMIMM Mill 

Db 334 6 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 34 05 

Qy 3571 AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 363 0 

II IMII llllll III Mill II 1 1 I M 1 1 1 1 1 1 llllll MMI 

Db 34 06 AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 3465 

Qy 3631 GAACGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGAT 3690 

I 1 1 I i 1 1 1 1 I III Mill: II llllll IMIMI MM llllll III 

Db 3466 GAACGG CAT CAGG CG CAGATAGAT CATTAT CTAGGACTTG CAAATAAGAATGTTAAAGAT 3525 

Qy 3691 GCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGA 374 0 

II Mill IMIIMII II MINI llllll III IIIMII II 

Db 3526 G CTATGG CTAAAAT CCAAG CAAAAATCCCTGGATTGAAG CG CAAAG CTGA 3575 



RESULT 10 
ABN86601 

ID ABN86601 standard; DNA; 3579 BP. 
XX 

AC ABN86601; 
XX 

DT 05-NOV-2002 {first entry) 
XX 

DE Human neurotransmitter receptor protein Nogo encoding DNA. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 
KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 
KW vulnerary; cerebroprotective; anti-tumour; antidiabetic; anticonvulsant; 
KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic ; 
KW osteopathic; vasotropic; nephrotropic ; cytostatic; antigen; gene therapy; 
KW neurotransmitter receptor; human; gene; ds . 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 1..3579 

FT /*tag= a 

FT /product= "Nogo" 

FT /note= "Nogo-A, Nogo-B and Nogo -C 11 

XX 

PN US2002072493-A1. 



XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-0893348 . 
XX 

PR 19-MAY-1998; 98 IL- 0124500 . 

PR 21-JUL-1998; 98WO-US14 715 . 

PR 22-DEC-1998; 98US-0218277 . 

PR 19-MAY-1999; 99US-0314 161 . 
XX 

PA (YEDA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach- Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR P-PSDB; ABB81078, ABB81079, ABB81080. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in 

PT the central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides 
XX 

PS Disclosure; Page 49-53; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central /peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central /peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease 

CC is diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzf eldt-Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma - 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates) , Charcot -Marie -Tooth disease, ataxia 

CC telangectasia , Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or 1 ipoproteinemia . The present sequence represents a DNA 

CC encoding the human neurotransmitter receptor protein Nogo (Nogo-A, Nogo-B 

CC and Nogo-C) , an example of NS-specific antigen. 

XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 other; 



Query Match 61.2%; Score 2289.2; DB 24; Length 3579; 

Best Local Similarity 81.5%; Pred. No. 0; 

Matches 2 925; Conservative 0; Mismatches 548; Indels 117; Gaps 19; 

253 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 312 

MINIMI I MINIM I INN Mill IIIIIIIIII ill III 

1 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 57 

313 CCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAG 372 

I MINI ill IIIM Mill MIIMI llllllll II II Ml 

58 CAG CCCG CGTT CAAGTAC CAGTTCGTGAGGGAGCCCGAGGACGAGGAG GAAGAAGAG 114 

373 GAGGAGGACGAGGAGGAGGACGACGAGGAC CTAGAGGAACTGGAGGTG CTGGAGAGGAAG 432 

llllllll llllllll II Mill Mill Mill MIMM II III M I! 

115 GAGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAGCTGG GGTG CTGGAGAGGAAG 174 
433 CCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCGCCGCGCCGCTG 486 

Mill IMIMMMMM II I MM I II lllllll llllll Ml 

175 GCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 234 
487 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 54 6 

llllllll I I MM Mill II IIIIIMI II II lllllllllll Ml 

23 5 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 294 
54 7 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 5 97 

II I III II III lllllll II Mill I Mill I I I MM 

2 95 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 354 
5 98 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

MIMIIMI II I I MM MUM lllllll MIM IMMIMMM 

355 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 414 

658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

llllllll I I I I I I I I I II II I I I I I II I II Ml II I II I I II 

4 15 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 474 

712 CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

MMIIIIIIIIM Mill MIIMI MUM II 

475 TGGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 534 
751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 807 

MIIIIMI MIIIIIIIMMIMIIIIIIIMIIIIIMIIIIIIIIIIIIIIII 

53 5 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 594 
8 08 GTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAGGTAACACTGTT 867 

lllllll MMMMMIMM llllll MM IIMMIIIIIIIIIIIM II 

5 95 GTGATACG CT C CT CTG CAGAAAA TATGGACTTGAAGGAGCAGCCAGGTAACACTATT 651 

8 68 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 927 

III M MIMIMI IMIM MMMMIMMMM llllll I II II II II 1 1 1 

652 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 711 
928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 

II MIMIMI | | mill IMIIIMM IIIIIIIIIIII II III II 

712 CTGTCTCCTCTCT(^GCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 



988 GTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 104 7 



II I I II I MINI II MINI I I I MINIMUM I II 

Db 772 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA. 831 

Qy 104 8 GAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAG CAGAATTTT CAGAATTAGAATAT 1107 

1 1 1 1 Mill II II I I II Mlllllllll 1 1 1 1 Ml III M MM 

Db 832 GAGAAGGCAAAAACTCTACT CATAGATAGAGATTTAACAGAGTTTT CAGAATTAGAATAC 8 91 

Qy 1108 TCAGAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCATATTAGTAGAA 1167 

I MMMMIMM II I I 1 1 1 Ml III Ml II III II MUM I 

Db 8 92 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 951 
Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGT 1224 

II 111 MIMM MM MM I MM II Mill I III: III I 

Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 
Qy 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 1269 

MIMI I II llllll I MM III llllll II 

Db 1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 1270 AGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTAGTA 132 9 

I III 1 1 Ml I MIMM Ml MM II I M I II II I I I MM I I 

Db 1072 GAAGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTGGAA 1131 

Qy 1330 GCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGGGAAGTGAAA 138 9 

II III MIMM II IMMIIMM II I Ml I II IMM Mill 

Db 1132 GCT CCTATGAGGGAGGAATATG CAGACTTCAAAC CATTTGAGCGAGTATGGGAAGTGAAA 1191 
Qy 1390 GATACTTATGAGGGAAGTAGGGATGTG CTGGCTG CTAGAG CT AATGTG 1437 

MM I Ml I III Ml II MM I III I II M 

Db 1192 GATA GTAAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAATCGAGAGCAACTTG 1248 

Qy 1438 GAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGGAAG 14 97 

1 1 1 1 1 1 II i 1 1 i ; I I llllll II I MINIMI MIMM llllll 

Db 124 9 GAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAAAAA 1308 

Qy 14 98 GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 1557 

MINIM I II Mill III MINIUM llllll llllll I I INN 

Db 1309 GATAGTGAGAGTAGTAATGATGATACTTCTTTCC CCAGTACG CCAGAAGGTATAAAGGAT 1368 
Qy 1558 AGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCT CAGCAACCGAAAGCACCACA 1614 

I I MINIMI II llllll llllll I llllll II Nil II 

Db 1369 CGTCCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCA 1428 

Qy 1615 GCAAACACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAA 1674 

llllll III MINIUM INN NNNNNINN II NNININN 

Db 142 9 ACAAACATTTTTCCT12 5X6 GAGATCCTACTTCAGAAAATAAGACCGATGASEESfiKa 1488 
Qy 1675 ATAGAAGAAAGGAAGG C CCAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 1731 

IMMIIMI MIMMIMM MIMMIMI llllll MIMM Mill 

Db 148 9 AT AG AAG AAAAG AAGG C C CAAATAG TAA CAG AG AAG AAT A C TAG CA C CAAAA CAT CAAA C 154 8 

Qy 1732 CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 1791 

Mill Ml MUM MMMMIMM I MIMM llllll Ml II 

Db 154 9 CCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTAACA 1608 

Qy 17 92 AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 1851 

NNNNINN II III MMMIINIINN INN NINIINN III 



Db 1609 AAGGTGACTGAGGAAGTCGTGG CAAACATG C CTGAAGGCCTGACT C CAGATTTAGTACAG 1668 

Qy 18 52 GAAG CATGTGAAAGTGAACTGAATGAAG CCACAGGTACAAAGATTG CTTATGAAACAAAA 1911 

IIIIIIIII1IIMMM lllllllll II IIIIIIIIIIIMIIIIIIMIIIIII 

Db 1669 GAAG CATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTG CTTATGAAACAAAA 172 8 

Qy 1912 GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCAG 1971 

MINI I lllllllllllll III Mill Mill II II IMIII 

Db 172 9 ATGGACTTGGTTCAAACATC^GAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAGC^^ 178 8 

Qy 1972 TGCCCATC^TTTGAGGAAGCTGAAGO^CTCCGTCACCAGTTTTGCCTGATATTGTTATG 2031 

IMIII II I II I Mill Mill MMMMMMIMM I Mill 

Db 178 9 TGCCCATC^TTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 184 8 

Qy 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2 0 91 

I I I I I I : : I i Mill! MM II lllllllllll II I llllllll I 

Db 184 9 GAAGC^C(^TTGAATTCTG(^GTTCCTAGTGCTGGTGCTTCCGTGATAC^GCCC^GCTCA 1908 

Qy 2 092 TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 

II III I MM II II IMIII IMIII II Mill I 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1909 TCACCATTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAAC 1965 

Qy 2152 CCCCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAGCTTTGGGAACAAAGGAA 22 08 

MM MUM MM MM MM MM Mill I MM MM 

.Db 1966 CCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAGGAA 2 025 

Qy 22 09 GGAATAAAAGAGCCTGAAAGTTTTAATGCAG CTGTTCAGGAAACAGAAG CT CCTTATATA 22 68 

I III MM MUM I M II Ml MM II 1 1 II 1 1 1 1 1 II I II 1 1 1 II 

Db 2 026 GAAATTAAAGAG CCTGAAAATATTAATG CAG CT CTTCAAGAAACAGAAGCT CCTTATATA 2 085 

Qy 2269 TCt^TTGCGTGTGATTTAATTAAAGAAAO^GCTCTCCACTGAGCCAAGTCCAGATTTC 2328 

II Mill MIMIMIII MUM Mill II MM III III IMIII 

Db 208 6 TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 214 5 

Qy 232 9 TCTAATTATTCAGAAATAG CAAAATT CGAGAAGTCGGTG CC CGAACACGCTGAGCTAGTG 2388 

III lllllllllllll IMIII III II I Mill II II IIIIIIIIII 

Db 214 6 TCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTT 22 05 

Qy 238 9 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 244 8 

II II Mill I II MM MIMIMMMMM Mill III IMIII II II Mill 

Db 22 06 GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 2265 
Qy 244 9 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGA A 2502 

II IIIIIII IMIII II IIIMIIIII MM II lllllllllll I 

Db 2266 GTTCC^CAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTC^CTGAGACTTCA 2325 

Qy' 25 03 GTGTCTGAGACAGTAG CC CAG CACAAAGAGGAGAGACTTAGTG CCT CACCT CAGGAG CTA 2562 

I llllllll MM I Ml Mill I II I III I 

Db 2326 TTTGAGT CAATGATAGAATATGAAAATAAGGAAAAACTCAGTG CTTTG C CACCTGAGGGA 23 85 
Qy 2563 GGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTAC^TAGTACAAAAGATGC TGCA 2619 

J 1 1 1 1 1 1 J I r 1 1 f J M IMIII Ml II MM Ml lllllllll I I 

Db 2386 GGAAAGCCATATTTGGAATCTTTTAAGCTC^GTTTAGATAACACAAAAGATACCCTGTTA 244 5 

Qy 262 0 TCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTT 267 9 

II MM II IMIMIII I MIMM li IM IIIIIII Mill III I 

Db 2446 CCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAGGAGCTC 25 05 



Qy 268 0 AATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAA 273 9 

I lllllll llllllllllllllllllll II lllllllllll I I I I I I I I I 

Db 2506 AGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAA 2565 

Qy 274 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 2799 

I llllll lillMlllllllilil Mill II llllllll II II II II II 

Db 2566 ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 2625 

Qy 28 00 AGTGCTAAAGATGATTC T CCTAAATTAG C CAAGGAGTACACTGATCTAGAAGTAT C C 2 8 56 

III Mill MUM I MINIMI: Ml II Mill IIMI Ml 

Db 2 626 AGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 2 685 

Qy 2857 GACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTG 2 916 

1 1 III Ml Mill lllllll II II II I 1 i 1 1 1 1 1 1 ] i I i llllll I 

Db 2 686 CACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTG 274 5 

Qy 2 917 CCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 2 970 

III IMMMIMM Mill Ml I II MM Mill I I I III 

Db 2746 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCA 2 8 05 

Qy 2 971 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTC 3 03 0 

Mill II II IIMI MM I III MM I III MM II MM 

Db 28 06 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 2865 

Qy 3 031 T CTG CTTTGGAACCT CAGACAGAAATGGG CAG CATAGTTAAATCCAAAT CACTTACGAAA 3 090 

MM MM MM MM II I MINIM Mill III MM 

Db 2 8 66 T CTG CTTTGG CCACT CAAG CAGAGATAGAGAG CATAGTTAAACCCAAAGTT CTTGTGAAA 2 92 5 

Qy 3 091 GAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTA 3150 

Mill MM M IMIII II IIMI II II 1 : 1 1 1 1 I I II III II 

Db 2 92 6 GAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCTGCTATA 2 985 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

II MMMMMMMMMMMMMMMMMIMI IIMI IMIII III 

Db 298 6 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAG 304 5 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 3270 

MMMMMMMMMMMIMM Mill MM II lllllll llllll 

Db 3 04 6 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATT{ZAGC 3105 

Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 333 0 

Mill II IIMI Ml II llllll MMM MM Mill IMMMIMM 

Db 3106 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 3165 

Qy 3331 ATATATAAGGG CGTGAT CCAGG CTATCCAGAAATCAGATGAAGGCCACC CATTCAGGG CA 33 90 

Mill Mill llllllll ! i 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 ! 1 1 ! I ! 1 1 i 1 1 ! 1 1 1 

Db 3166 ATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 3225 

Qy 33 91 TATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCT 3450 

III I llllllllllllllllllll Mill 1 1 1 1 1 ! 1 1 1 1 1 IIIIIIIIIIMIM 

Db 322 6 TATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCT 3285 

Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3510 

IMIIIIIIIIIIM MM Mill Mill Mill II II I II I M I II 1 1 1 II 1 1 

Db 328 6 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 334 5 



Qy 


3511 


GTTGATTCCCTGAAGTTTGOVGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 

MINIM 1 II 1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 II 1 1 1 lllll II Mi 1 1 1 Ml 1 1 II 

GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 


3570 


Db 


3346 


3405 


Qy 


3571 


AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 

IIIIIIIIIIIMIIIIIIIIII lllll II MM'! II II MINI Mill 

AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 


3630 


Db 


3406 


3465 


Qy 


3631 


GAACGG CAT CAGGTG CAGATAGATCATTATCTAGGACTTG CAAACAAGAGTGTTAAGGAT 

MIMIIIIIIII III llllll MINI IIIIIMM: MM MMM III 

GAACGG CAT CAGGCG CAGATAGATCATTATCTAGGACTTG CAAATAAGAATGTTAAAGAT 


3690 


Db 


3466 


3525 


Qy 


3691 


G C CATGGCCAAAAT C CAAG CAAAAATCCCTGGATTGAAG CGCAAAG CAGA 374 0 

II lllll : i II 

GCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGA 3575 




Db 


3526 





RESULT 11 
ABK90134 

ID ABK90134 standard; DNA; 3579 BP. 
XX 

AC ABK9 0134; 
XX 

DT 21-OCT-2002 (first entry) 
XX 

DE DNA encoding human NogoA protein. 
XX 

KW Human; Nogo; BACE; acute neuronal injury; spinal injury; head injury; 
KW stroke; peripheral nerve damage; neoplastic disorder; glioblastoma; 
KW neuroblastoma; hyperprol iterative disorder; dysprol iterative disorder; 
KW cirrhosis; psoriasis; keloid formation; fibrocystic condition; cancer; 
KW tissue hypertrophy; central nervous system; axon regeneration; NogoA; 
KW Nogo-associated disease; metastasis; gene; ds . 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 1 . . 3579 

FT /*tag= a 

FT /product = "Human NogoA protein" 

XX 

PN WO200257483-A2 . 
XX 

PD 25-JUL-2002. 
XX 

PF 18-JAN-2002; 2002WO-GB00228 . 
XX 

PR 18-JAN-2001; 2001GB-0001312 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 

PA {SMIK ) SMITHKLINE BEECHAM PLC. 

XX 

PI Blackstock WP, Hale RS, Prinjha R, Rowley A; 
XX 

DR WPI; 2002-599722/64. 
DR P-PSDB; ABG30938. 
XX 



PT Identifying modulators of Nogo or BACE activity for treating acute 

PT neuronal injuries, neoplastic or dysprol iterative disorders, comprises 

PT providing and monitoring interaction between Nogo and BACE polypeptides 

PT 

XX 

PS Disclosure; Page 53-58; 68pp; English. 
XX 

CC The present invention relates to a new method of identifying modulators 

CC of Nogo function or BACE activity. The method involves providing Nogo and 

CC BACE polypeptides capable of binding with each other, monitoring the 

CC interaction between these polypeptides, and determining if the test agent 

CC is a modulator of Nogo or BACE activity. The method is useful in treating 

CC acute neuronal injuries, such as spinal or head injury, stroke, 

CC peripheral nerve damage, and in neoplastic (e.g. glioblastomas, 

CC neuroblastomas) , hyperprol iterative or dysprol iterative disor ers (e.g. 9 

CC cirrhosis, psoriasis, keloid formation, fibrocystic conditions, tissue 

CC hypertrophy) of the central nervous system. The BACE polypeptide is 

CC useful in screening methods to identify agents that may act as modulators 

CC of BACE activity and in particular agents that may be useful in treating 

CC Nogo-associated diseases. The modulators of Nogo or BACE polypeptides, 

CC and the polynucleotide encoding the BACE polypeptide are useful in 

CC manufacturing a medicament for the treatment or prevention of disorders 

CC responsive to the modulation of Nogo activity, in alleviating the 

CC symptoms or improving the condition of a patient suffering from this 

CC disorder, in axon regeneration, or in preventing metastasis or spreading 

CC of a cancer. The polynucleotide may also be an essential component in 

CC assays, a probe, in recombinant protein synthesis., and in gene therapy 

CC techniques. The present nucleic acid sequence encodes the human NogoA 

CC protein of the invention. 

XX 

SQ Sequence 3579 BP; 1074 A; 803 C; 812 G; 890 T; 0 other; 

Query Match 61.2%; Score 2289.2; DB 24; Length 3579; 

Best Local Similarity 81.5%; Pred. No. 0; 

Matches 2925; Conservative 0; Mismatches 548; Indels 117; Gaps 19; 
Qy 253 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 312 

MINIMI I Mill I Mill MM MMMMM MMMM 

Db 1 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 57 

Qy 313 CCG C CCGC CTTCAAGTAC CAGTTCGTGACGGAG CC CGAGGACGAGGAGGACGAGGAGGAG 372 

I MMM IIIIIIIIIIIIMIMII 1 1 1 1 1 M 1 1 1 1 1 1 11 1 1 M M 1 1 Ml 

Db 58 CAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG GAAGAAGAG 114 

Qy 373 GAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAG 432 

MMMM Mill III II Mill Mill Mill I II I II 1 1 1 1 M II I II I II I 

Db 115 GAGGAGGAAGAGGAGGACGAGGACGAAGAC CTGGAGGAG CTGGAGGTGCTGGAGAGGAAG 174 
Qy 433 CCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCGCCGCGCCGCTG 486 

Mill ; 1 : 1 1 1 1 1 1 : 1 1 1 II I MM I II MMM MMM Ml 

Db 17 5 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 234 

Qy 4 87 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 54 6 

MINIM I I MM MMI M MMMM M M MMMMMI Ml 

Db 235 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 294 
Qy 547 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 597 



II I III II Ml IIIIIII II Mill I MMI I I I MM 

Db 295 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 354 

Qy 598 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

Illlllllll II I I IIIIIIIIIIII I II I M 1 1 1 1 1 II 1 1 MMMIIMM 

Db 355 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 414 

Qy 658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

1 1 1 1 1 f 1 1 MM II Mill M II III Ml II II III III 

Db 415 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 474 

Qy 712 CCCGCCGCGCGCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

IIIIIIIIIIIMI Mill I ! I I I : i I I 

Db 4 75 TGGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 534 

Qy 751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 807 

lllllllll MIMMMMMIMMMMMMMMMIMMIMMMMM 

Db 53 5 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 594 

Qy 8 08 GTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAGGTAACACTGTT 867 

IIIIIII IMIMMMIMM MUM MM 1 1 i 1 1 [ I M [ 1 1 1 1 1 1 1 ! I II 

Db 595 GTGATACGCTCCTCTGCAGAAAA TATGGACTTGAAGGAG CAGC CAGGTAACACTATT 651 

Qy 8 68 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 927 

Ml III llllll IIIMNII MINI llllll MMM IMIIIIMIII 

Db 652 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 711 

Qy 928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 

II MMMIIMM I I IMIII INI Ml IIIIIIIIIIII M III II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 98 8 GTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 104 7 

II I I II I lllllllll II llllll I I I 1 1 1 1 1 1 1 1 1 1 i ! M : I II 

Db 772 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 1048 GAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAGAATTAGAATAT 1107 

MM Mill II II I I II Illlllllll MM IIIIIIIIIIIIIII 

Db 832 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 891 

Qy 1108 TCAGAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCATATTAGTAGAA 1167 

llllll MINI II I I III IIIIIII III II III M llllll I 

Db 892 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 951 

Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGT 1224 

II Ml Mill II MM MM Mill II I II II I 1 1 1 1 1 1 I II I 

Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 

Qy 122 5 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 12 69 

MUM I II llllll I MM III llllll II 

Db 1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 127 0 AGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTAGTA 132 9 

Illlllllll Mill Ml MM MINIM I I MM I I 

Db 1072 GAAGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTGGAA 1131 

Qy 133 0 G CACCTGTGAGGGAAGAGTATG CAGACTTTAAG CCATTTGAACAAG CATGGGAAGTGAAA 138 9 

M III MM II MIMIIMM II II III I II MIMIMIMM 



Db 



1132 GCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTGAAA 1191 



Qy 13 90 GATACTTATGAGGGAAGTAGGGATGTG CTGG CTG CTAGAGCT AATGTG 1437 

MM I Ml I Ml III II MIMMI III I 1 1 II 

Db 1192 GATA GTAAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAAT CGAGAG CAACTTG 1248 

Qy 1438 GAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGGAAG 14 97 

1 1 i 1 1 1 1 1 1 1 1 1 1 1 I llllll II I IMIM 1 1 II 1 1 1 I II I II 

Db 124 9 GAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAAAAA 1308 

Qy 14 98 GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 1557 

Illlllll I M Mill III IIIIIIIIIIIIIMI MIMI I I Mill 

Db 13 09 GATAGTGAGAGTAGTAATGATGATACTTCTTTC CCCAGTACG CCAGAAGGTATAAAGGAT 1368 

Qy 1558 AGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCT CAGCAACCGAAAGCACCACA 1614 

I I I I I I I I I I I I I I I I I I I I M I M I I I II I I I I I I M l4T4T4t 3599 

Db 136 9 CGTCCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCA 1428 

Qy 1615 GCAAACACTTTCCCTTTGTTAGAAGATC^TACTTC^GAAAATAAAACAGATGAAAAAAAA 1674 

llllll III MMMIIM Mill 1 1 1 1 1 1 1 1 1 ! 1 1! 1 1 II IIIIIMIIIII 

Db 142 9 ACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAAAAA 148 8 
Qy 167 5 ATAGAAGAAAGGAAGG CCCAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 1731 

MMMIIM MINI! i MMIMIIII llllll lllllll Mill 

Db 14 89 ATAGAAGAAAAGAAGG C C CAAATAG TAACAGAGAAGAATACTAG CA C CAAAA CATCAAA C 1548 

Qy 1732 CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 1791 

Mill IIIIIMIM M II III M MIMIMM MMMIIM Ml II 

Db 154 9 CCTTTTCTTGTAGCAGCAC^GGATTCTGAGACAGATTATGTCACAACAGATAATTTAACA 1608 

Qy 1792 AAGGTGACTGAGGCAGCAGTGTCAAAGATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 1851 

MINIMI! II III IN II MINIM Mill MMIMIIII III 

Db 1609 AAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTACAG 1668 

Qy 1852 GAAGCATGTGAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAA 1911 

I lllllll MIMIMM Ml llllll M II 1 1 II M I M 1 1 1 M I M 1 1 II I M I 

Db 1669 GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAA 1728 

Qy 1912 GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATC^CTTTACCCCACAGCACAGCTT 1971 

MMMIIM MIMMMMM III Mill Mill II II IN INN 

Db 1729 ATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAGCTT 178 8 

Qy 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2031 

Illlllll Mill II I Mill Mill lllllll Mill MM MINIMI 

Db 1789 TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 184 8 

Qy 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2 091 

I I 1 1 1 : 1 1 1 1 llllll MM II MMIMIIII II I Illlllll I 

Db 184 9 GAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCA 1908 

Qy 2 092 TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 

II III I INI II II llllll llllll II Mill I lllllllllllll 

Db 1909 TCACCATTAGAAG CTTCTT CAGTTAATTATGAAAG CATAAAACATGAG CCTGAAAAC 1965 

Qy 2152 CCCCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAG CTTTGGGAACAAAGGAA 22 08 

I llllll NUN lllllll MM MM Mill I MM lllllll 

Db 1966 CCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAGGAA 2 025 



Qy 2209 GGAATAAAAGAG C CTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAG CTCCTTATATA 22 68 

I III Mill 1 I lllllllllll 1 1 1 1 II I I 

Db 2 026 GAAATTAAAGAG C CTGAAAATATTAATGCAGCTCTT CAAGAAACAGAAGCTCCTTATATA 2085 

Qy 22 69 TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 2328 

II Mill IIIIIIMMIIIIIIIIIIIIMM II 1 1 1 1 III III Mil 

Db 2 08 6 TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 2145 

Qy 232 9 TCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTG 23 88 

III IIIMIIIIIMI MINI III II I Mill II 1 1 IMIIMIM 

Db 2146 TCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTT 22 05 

Qy 238 9 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 244 8 

II I MINIMI MM Mill MMM IMIMM 1 1 Mill 

Db 2206 GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 2265 
Qy 2449 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGA A 2502 

II MIMM MIMI 1 1 MINIUM MM 1 1 lllllllllll I 

Db 22 66 GTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCA 2325 

Qy 2503 GTGTCTGAGACAGTAG CC CAG CACAAA.GAGGAGAGACTTAGTG CCTCACCTCAGGAGCTA 2562 

I I III I I II MM I III Mill I II I Ml I 

Db 232 6 TTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGCCACCTGAGGGA 23 85 
Qy 2 563 GGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCA 2619 

1 1 1 1 1 1 ! I 1 : 1 1 II MINI III II Nil III Mill I I 

Db 238 6 GGAAAG CCATATTTGGAATCTTTTAAG CT CAGTTTAGATAACACAAAAGATAC CCTGTTA 24 4 5 

Qy 2 62 0 TCTAATGAC^TTCCAAC7\TTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTT 2679 

M MM II lllllllll I MMM Mill MIMM Mill III I 

Db 244 6 CCTGATGAAGTTTCAACATTGAG CAAAAAGGAGAAAATTCCTTTG CAGATGGAGGAG CTC 25 05 

Qy 2 680 AATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAA 273 9 

I MMM Mill II III li Ml II IM IMM I MM MM 

Db 2506 AGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAA 2565 

Qy 274 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 2799 

I MINI ' I I Mill II III II II II II II 

Db 2566 ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 2 625 
Qy 28 00 AGTGCTAAAGATGATTC T C CTAAATTAGC CAAGGAGTACACTGATCTAGAAGTATCC 2856 

III lllll MINI I IMIMM Ml II 1 1 II I MIMM II 

Db 2626 AGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 2685 

Qy 2857 GACAAAAGTGAAATTG CTAATAT CCAAAG CGGGGCAGATT CATTG C CTTG CTTAGAATTG 2 916 

M II I! I I || lllll III MIMM II MIMM 

Db 2 686 CACAAAAGTGAAATTG CTAATG CCCCGGATGGAG CTGGGTCATTG CCTTG CACAGAATTG 2745 
Qy 2 917 CCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 2 970 

III 1 1 1 1 1 1 1 1 . INN Ml I II MM lllll I I I III 

Db 274 6 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCA 28 05 

Qy 2 971 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTC 303 0 

lllll M II lllll Mil I Ml MM I III Nil II INI 

Db 28 06 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 28 65 



Qy 3031 TCTG CTTTGGAAC CT CAGACAGAAATGGG CAG CATAGTTAAATC CAAATCACTTACGAAA 3090 

i 1 1 1 1 1 1 1 1 1 1 I IIIIIMIIIII Mill III MM 

Db 2866 TCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAA 2 925 

Qy 3 0 91 GAAG CAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGAT C CCTGTCAG CTGTA 3150 

Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M Mill IMIIMIMMM I II III II 

Db 2 926 GAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCTGCTATA 2 985 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

II IIIIIMIIIII li MrilMII II MIM II IIIIIMIIIII:! 

Db 2 986 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAG 3045 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 327 0 

IIIIIIIIIIIIIIIIIIIIIMMII MMIMMIMI II lllllll IMIM 

Db 3046 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGC 3105 

Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 333 0 

MIM II Mill MINIM MM, lllllll Mill ill, M Mil 

Db 3106 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 3165 

Qy 3331 ATATATAAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 3390 

MM Mill IIIIMM 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 : 1 1 i 1 1 1 1 1 1 : > I I I I 

Db 3166 ATATACAAGGGTGTGATC CAAGCTATC CAGAAAT CAGATGAAGG CCACCCATTCAGGG CA 322 5 

Qy 33 91 TATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCT 34 5 0 

III I MMMMMMMMIMI Mill IIIIIIIIIII 

Db 3226 TATCTGGAAT CTGAAGTTG CTATAT CTGAGGAGTTGGTTCAGAAGTACAGTAATTCTG CT 3285 

Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3510 

Mill MM! MM Mill Mill Mill II 1 1 1 II 1 1 1 1 1 M II II M 

Db 3286 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 334 5 

Qy 3511 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 357 0 

IIIIMM MIMMMIMMIMMMMMI Mill MM Mill IM 

Db 3346 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 34 05 

Qy 3 571 AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 3 63 0 

IIIIIMIIIII II MM Mill II 1 1 1 1 : 1 1 1 1 1 1 Ml Mill II 

Db 34 06 AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 3465 

Qy 3 631 GAACGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGAT 369 0 

MMIMMIMI II II MIM II II II II MM II I MM IMIM III 

Db 34 66 GAACGGCATCAGGCGCAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTTAAAGAT 352 5 

Qy 3691 GCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGA 374 0 

II Mill IM Illlllll MIM MMMMIMM II 

Db 3526 GCTATGGCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGA 3575 



RESULT 12 
AAD01174 

ID AAD01174 Standard; CDNA; 3833 BP. 
XX 

AC AAD01174; 
XX 

DT 02-NOV-2000 (first entry) 
XX 



DE Bovine neurit e growth inhibitor Nogo cDNA. 
XX 

KW Bovine; neurite growth inhibitor; Nogo; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma,- 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprol iterative disorder; benign dysprol iterative disorder; diagnosis ; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; ss. 

XX 

OS Bos sp. 
XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US2616 0 . 
XX 

PR 06-NOV-1998; 98US-0107446 . 
XX 

PA (SCHW/) SCHWAB ME. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating eoplastic 5 

PT disorders of the central nervous system and inducing regeneration of 

PT neurons - 
XX 

PS Claim 26; Fig 12; 122pp; English. 
XX 

CC The present sequence is a cDNA encoding bovine Nogo protein which is a 

CC potent neural cell growth inhibitor and is free of all central nervous 

CC system (CNS) myelin material with which it is natively associated. The 

CC present sequence was obtained from bovine spinal cord white matter cDNA 

CC library. Nogo proteins and fragments displaying neurite growth inhibitory 

CC activity are used in the treatment of neoplastic disease of the CNS 

CC e.g. glioma, glioblastoma, medulloblastoma , craniopharyngioma, ependyoma, 

CC pinealoma, haemangioblastoma , acoustic neuroma, oligodendroglioma, 

CC menagioma, neuroblastoma or retinoblastoma and degenerative nerve 

CC diseases e.g. Alzheimer's and Parkinson's diseases. Therapeutics which 

CC promo Nogo activity can be used to treat or prevent hyperprolffif Srative 

CC or benign dysprol iterative disorders e.g. psoriasis and tissue 

CC hypertrophy. Ribozymes or antisense Nogo nucleic acids can be used to 

CC inhibit production of Nogo protein to induce regeneration of neurons or 

CC to promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. 

CC The animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which 

CC can treat or prevent disorders or diseases of the CNS . 

CC Note: SEQ ID numbers 35-42 are referred in claim 32 and SEQ ID NO: 29 

CC in disclosure of the specification. However the specification does not 

CC include sequences for these SEQ ID numbers. 

XX 

SQ Sequence 3833 BP; 1235 A; 717 C; 818 G; 1063 T; 0 other; 



Query Match 50.0%; Score 1869.8; DB 21; Length 3833; 

Best Local Similarity 80.9%; Pred. No. 0; 

Matches 2320; Conservative 0; Mismatches 492; Indels 55; Gaps 10; 



Qy 928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 

MINI Mill I I I MINIM! II MINIUM I III Mill 

Db 1 CTATCTCCTCTCTCAGCCGCTGCTTTTAAAGAACGTGAATACCTTGGTGATTTACCAGCA 60 

Qy 98 8 GTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 104 7 

II II I IMIMIII II II Mill MMIMIMMIMM II M 

Db 61 GTACTGCCCACTGAAGGAACACTTCCAGCAACTTCAAATGAAGCTTCTAAAGCATTCTCA 120 

Qy 1048 GAGAGGGCAACAAAT C CATTTGTAAATAGAGATTTAG CAGAATTTT CAGAATTAGAATAT 1107 

MM Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 I III Mill 1 1 1 1 Mill 1 . 1 1 1 MINI 

Db 121 GAGAAGGCAAAAAATCCATTTGTAGAGAGAAATTTAACAGAATTTTCAGAATTGGAATAT 18 0 

Qy 1108 TCAGAAATGGGAT CATCTTTTAAAGG CT C C CCAAAAGGAGAGT CAG CCATATTAGTAGAA 1167 

IMIIIMM MINI II I Mill I III I III I III II Mill 

Db 181 TCAGAAATGGAATCATC^TTCAGTGGCTCTCAAAAGGCAGAACCTGCCGTAACAGTAGCG 24 0 
Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAAGACA AAGAGGATTTAGTTTGTAGT 1224 

II Ml III Ml II Mill MM III I Mill! Ml I I 

Db 241 AATCCTAGGGACGAAATAGTTGTGAGGAGTAGAGATAAAGAAGAGGACTTAGTTAGTCTT 300 
Qy 122 5 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAA 1266 

llllll III II II I I II M I Mill 

Db 301 AACAT CCTT CATACT CAG CAGGAGTTAT CTACAGT C CTTACGAAAT CAGTTGAAGAAGAA 360 

Qy 1267 GACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTA 132 6 

II llllll IMMM III MM MM! Ml MM I I III I 

Db 361 GATAGAGTTCTGTCTCCAGAAAAAAOWVGGACAGTTTTAAGGAAAAGGGAGTTG(^G(^ 42 0 

Qy 1327 GTAGCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGGGAAGTG 138 6 

I Ml II II MM II IMMM M IIIIMM I II IMMM 

Db 421 GAAGCTTCTATGGGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTG 48 0 
Qy 138 7 AAAGATACTTATGAGGGAAGTAGGGATGTGCTGGCTGCTAGAGCTAAT 1434 

llllll I: II I Ml Mill II MM III MM 

Db 481 AAAGATACTTACAAG CAAGATAGTGATGTTTTGATTG CTGGAGGTAATATAGAGAG CAAA 54 0 

Qy 1435 GTGGAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGG 14 94 

Mill IMMM I Ml Ml 1 1 1 1 1 1 i 1 1 II MM II II II 

Db 541 TTGGAAGGTAAAGTGGATAAGAAACACTTTTCAGATAGCCTTGAACAAACAAATCGTGAA 600 

Qy 14 95 AAGGATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAG 1554 

II IMMM MM Mill II MM II 11 1 1 It i E llllll MM I 

Db 601 AAAGATAGTGAAAGCAGTAATGATGACACTTCATTTCCC^GTACACCAGAAGCTGTAAGA 660 

Qy 1555 GACAGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCGAAAGCACCACA 1614 

I I III MM M II II MUM I II Ml I II MM II I II 

Db 661 GGTGGTTCCGGAGCGTA(^TCACGTGTGCTCCCTTTAACCCAA(^yVCTGAGAATGTTTCA 72 0 

Qy 1615 G CAAACACTTT C CCTTTGTTAGAAGATCATACT 1674 

llllll III II Mill MIIMIMIMM IIIIMM IIMIIIIIMIII 

Db 721 ACAAACATTTTTCCCTTGTTGGAAGATCATACTTCGGAAAATAAGACAGATGAAAAAAAG 78 0 



Qy 1675 ATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAGA CTAGCCCCAAAACGTCAAAT 1731 

MINI III Mill Illlll llllllllllll I II III II Mill 

Db 781 ATAGAA - AAAAAAAGGCACAAATTGTAACAGAGAAGAATG CAAGTGT CAAGACATCAAAC 83 9 

Qy 1732 C CTTT CCTTGTAG CAGTACAGGATT CTGAGGCAGATTATGTTACAACAGATAC CTTATCA 1791 

Illllllll I MM MMM III M MMMI ! I i 1 1 ! i 1 1 1 1 I III 

Db 84 0 CCTTTCCTTATGGCAGCACAGGAGTCTAAGACAGATTACGTTACAACAGATCATGTGTCA 8 99 

Qy 17 92 AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 18 51 

IMIMM MM II MM 1 1 II M 1 1 II 1 1 M II 1 1 1 II II III MMM 

Db 900 AAGGTGACCGAGGAAGTAGTGGCAAACATGCCTGAAGGTCTAACCCCAGATTTGGTTCAG 959 

Qy 1852 GAAG CATGTGAAAGTGAACTGAATGAAG CCACAGGTACAAAGATTG CTTATGAAACAAAA 1911 

II II I I I I I II I I II I II I I I I I II II I II Illlllll I II I I I I I I II I I I II 

Db 960 GAAG CATGTGAAAGTGAATTGAATGAAG CTACTGGTACAAAAATTG CCTTTGAAACAAA^ 1019 

Qy 1912 GTGGACTTGGTCCAAAC^TCAGAAGCTATACAAGAATCACTTTACCCCACAGCACAGCTT 1971 

Mill MM Mill IIIIIIIM I II II lllllllllll I IMIMM 

Db 102 0 ATGGACCTGGTTCAAACTTCAGAAGCTGTGCAGGAGTCACTTTACCGTGTAACACAGCTT 1079 

Qy 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2 031 

IMIMM Mill III MMMI lllllllllll Illlll: Mill III 

Db 108 0 TGCCCATCTTTTGAAGAATCTGAAGCTACTCCGTCACCGGTTTTGCCTGACATTGTCATG 113 9 

Qy 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2 091 

1 1 III Ml Mil II II II II II 1 1 1 M 1 1 M 1 1 IMIMM II 1 1 I 

Db 114 0 GAAGCACCATTAAATTCTGTAGTTCCTAGTGCTGGTGCTTCTGCAGTGCAGCTCAGTTCA 1199 

Qy 2 092 TCCGCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 

II III I Ml I I MM MMM Illlll II Illlll lie MttG 13 

Db 12 00 TCACCATTAGAAACTCTTCCTTCAGTTAATTATGAAAGCATAAAGTTTGAGCCTGAAAAT 12 59 
Qy 2152 CCCC CAC CATATGAAGAAG CCATGAATGTAGCACT AAAAGCTTTGGGAACAAAGGAA 22 08 

IMMMMIMM II MIMMMMI 1 1 1 1 Mill I MM II III 

Db 12 60 CCCC CAC CATATGAGGAGG C CATGAATGTATCACTAAAAAAAGAAT CAGGAATGAATGAA 1319 

Qy 22 09 GGAATAAAAGAG CCTGAAAGTTTTAATG CAGCTGTTCAGGAAACAGAAG CTCCTTATATA 22 68 

I III I MINIM II III II Mill MINIMI Illlll MMM 

Db 132 0 GAAAT CACAGAG CCTGAAGGTATTAGTGTAG CTGTT CAGGAAACAGAAG CTC CTTATATA 13 79 

Qy 2269 TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCC^GTCC^GATTTC 2328 

II Mill IIIIIIMIIIIIIMIIIMIII MM Mill II I I MINN 

Db 138 0 TCTATTGCATGTGATTTAATTAAAGAAACAAAGATCTCTACTGAACCGACTCCAGATTTC 143 9 

Qy 232 9 TCTAATTATTCAGAAATAG CAAAATT CGAGAAGTCGGTG CCCGAACACG CTGAGCTAGTG 23 8 8 

I I I I I II I II I I II M Ml I II I Illlllll II Illllllll 791 
Db 144 0 TCTAGTTATTCAGAAATAG CAGAAGTTG CACAG CCAGTG CCCGAG CATTCTGAGCTAGTT 14 99 

Qy 23 8 9 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 244 8 

II IMIMM II M ! 1 1 1 1 1 1 1 : 1 ; ! Ill 1 1 1 J i J 1 1 1 1 1 1 1 II II Ml 

Db 1500 GAAGATTCCTCCCCCGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCCGAA 1559 

Qy 244 9 GT CC CACAAACACAAGAGG AGG CTGTG ATG CTCATGAAGGAGAGT CT CACTGAAGTGT C - 25 07 

II MMMI MMM II MM M M MM II I IIIIIIIM I II 

Db 1560 GTTCCACAAAAACAAGATGAAG CTGTAATACTTGTGAAAGAAAAC CTCACTGAAATTT CA 1619 
Qy 2508 TGAGACAGTAGCCCAGCACAAAGAGGAGAGACTTAGTGCCTCAC CTCAGGAG 2 559 



Db 162 0 TCTGAGTCAATGACAGGAC^TGACA^ 1679 
Qy 2 560 CTAGGAAAGCCATATTTAGAGT CTTTTCAG C C CAATTTACATAGTACAAAAGATG C T 2616 

III II Mill I I I : ; ] I I ■ I I I I I I I I I I lllllllll I 

Db 1680 GGAGGAAAACCGTATTTGGAGT CTTTTCAG CCCAGTTTAGG CAT CACAAAAGATACCTTA 1739 

Qy 2 617 GCATCTAATGACATTCCAACATTGACCAAAAAG 2 676 

lllllllll II 1,1 llllllll II I II I II III II I MINI Mill III 

Db 174 0 GCACCTGATGAAGTTTCAGC^TTGACCCAAAAGGAGAAAATCCCTTTGCAGATGGAGGAG 17 99 

Qy 2 677 TTTAATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAA 2736 

I lllllllll lllllllll Mill MM I I II MUM II III I 

Db 18 00 CTCAATACTGCAGTTTATTCAAGTGATGGCTTATTCATTGCTCAGGAAGCAAACCTAAGA 18 59 

Qy 2737 GAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTT 27 96 

1 1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 1 MMMM M || || Ml 

Db 18 60 GAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATTATAGATGAGTTCCCGACCTTT 1919 
Qy 2797 GTCAGTGCTAA AGATGATTCTCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTA 28 53 

MUM MM MM llllllll llllllll III llllllll lllllllll 

Db 192 0 GTCAGTTCTAAAGCAGATTCTTCTCCTACATTAGCCAGGGAATACACTGACCTAGAAGTA 1979 

Qy 2854 TCCGACAAAAGTG7VAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAA 2 913 

M IIIIIIIIIIIMIIII I Mill M II I IMMI MM II I 

Db 198 0 GCCC^OUVAAGTGAAATTGCTGAC^TCCAGGATGGAGCTGGGTCATTGGCTTGTGCAGGA 2 03 9 

Qy 2 914 TTGCCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAGTACATGTTTCAGAT 2 973 

IMMI I I !l I I MM I llllllll Mill Mill Mill 

Db 2 04 0 TTGCCCCA.TGACCTTTCTTTCAAGAGTATACAACCTAAAGAGGAAGTTCATGTCCCAGAT 2 099 

Qy 2 974 GAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTCTCT 3 033 

II IMMI II Mill III II MM II II MM II MM III 

Db 2100 GAGTTCTCCAAAGATAGGGGTGATGTTTCAAAGGTGCCCGTACTGCCTCCAGATGTTTCT 2159 

Qy 3034 GCTTTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCAAATCACTTACGAAAGAA 3 0 93 

MMMM MM MM II llllllllll III Mill III IMIMI 

Db 2160 GCTTTGGATGCTCAAGCAGAGATAGGCAGCATAGAAAAACCC^WVGTTCTTGTGAAAGAA 2219 

Qy 3 094 GCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTATTG 3153 

II MM IMMMMMIMI Mill MUM III I M III MM 

Db 222 0 GCCGAGAGAAAACTT CCTTCTGATACAGAAAAAGAG CGAAGAT CTC CATCTG CTATATTT 2279 

Qy 3154 TCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAG 3213 

Ml MINI II illlil llllll Mllll II IIIIMMIIMMI Ml 

Db 228 0 T CAG CAGAG CTGAGTAAAACTTCAGTTGTTGAC CT C CTCTACTGGAGAGACATTAAGAAG 233 9 

Qy 3214 ACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATT 3273 

I I L i, . I II I I MIIMIMM II llllllll lllllllll 

Db 234 0 ACTGGAGTGGTGTTTGGTGCCAGCTTGTTCCTGCTGCTCTCGCTGACAGTATTCAGCATT 23 99 

Qy 3274 GTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATA 3333 

II MMMIMMMMMMMIMMIMIIMM MM II IMMI Mill 

Db 24 0 0 GTGAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATA 24 59 

Qy 3334 TATAAGGG CGTGAT CCAGG CTATC CAGAAATCAGATGAAGG CCACCCATTCAGGG CATAT 33 93 

MMMM M IMMI IMMI MMM I II M M II M II 1 1 1 1 II I II II II I 



Db 24 60 TATAAGGGTGTGATCCAGGCTATCCAGAAATCTGATGAAGGCCACCCATTCAGGGCATAT 2519 

Qy 3394 TTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTT 3453 

II IIIIIIIIIIIMIIIIIII Mill Nl II IN Mill Mill; 1 1 1 1 1 

Db 2 520 TTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTT 2 579 

Qy 3454 GGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTT 3 513 

MM Ml MMMIMMIMM M II M I II I II II 1 1 M M 1 1 II M I 

Db 258 0 GGTCATGTTAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTT 263 9 

Qy 3514 GATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAAT 3 573 

Mill llllllllllllllllllllllllll Mill MMMMMIMM Mill 

Db 2 64 0 GATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAAT 2 69 9 

Qy 3 574 GGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAA 3 633 

! ill I l^l^m^lll II llllllll 1 1 Ml I INI 1 1 III III Mill 1 1 II 1 1 

Db 2 700 GGTCTGACACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAA 275 9 

Qy 3 634 CGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGATGCC 3 693 

IIIMIM Ml 1 1 1 : 1 1 : 1 1 1 MMIMIMI MM MM Mill 

Db 2760 CGG CATCAGGCG CAAATAGATCATTAT CTGGGACTTGCAAATAAGAATGTTAAAGATG CT 2 819 

Qy 3 694 ATGG CCAAAAT CCAAG CAAAAAT C CCTGGATTGAAG CGCAAAG CAGA 374 0 

Mill IIIIIMIIIIIIIIIIIIIIIIIIMIIIII Mill II 

Db 282 0 ATGG CTAAAAT CCAAG CAAAAAT C CCTGGATTGAAG CGTAAAGCTGA 2 866 

RESULT 13 
AAV30920 



ID AAV30920 standard; cDNA; 2386 BP. 
XX 

AC AAV3 092 0; 
XX 

DT 14-SEP-1998 (first entry) 
XX 

DE Human secreted protein BG160_1 cDNA. 
XX 

KW BG160_1; secreted protein; protein factor; human; ds . 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 102..275X| 
FT /*tag= a 

FT sig_peptide 1863.. 1899 

FT /*tag= b 

FT /note= "putative leader/signal peptide" 

FT mat_peptide 1900.. 2027 

FT /*tag= c 

XX 

PN W09817687-A2 . 
XX 

PD 30-APR-1998. 
XX 

PF 24-OCT-1997; 97WO-US19590 . 
XX 

PR 24-OCT-1997; 97US - 074 0274 . 



PR 25-OCT-1996; 96US-0740274 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Agostino MJ, Jacobs K, Lavallie ER, McCoy JM, Merberg D; 

PI Racie LA, Spaulding V, Treacy M; 

XX 

DR WPI; 1998-261426/23. 

DR P-PSDB; AAW58383 . 
XX 

PT Nucleic acid encoding secreted protein from human cells - useful, 

PT e.g. as immunomodulator , antitumour agent, promoters of tissue 

PT growth, haemostatic and thrombolytic agents etc. 
XX 

PS Claim 20; Page 74-75; 114pp ; English. 
XX 

CC This cDNA clone, designated BG160__1, codes for a novel human 

CC secreted protein (see AAW58383) . It was isolated from a human adult 

CC brain cDNA library using methods selective for cDNAs that encode 

CC secreted proteins. The clone is deposited in composite clone 

CC ATCC 98232; an oligonucleotide (see AAT99725) is designed to isolate 

CC the clone from the composite. The predicted AT415_4 amino acid 

CC sequence shows homology to neuroendocrine- specif ic proteins. Novel 

CC cDNA clones (see AAV30916-32) coding for human secreted proteins (see 

CC AAW58580-90) are claimed. These can be used for recombinant 

CC production of the secreted proteins for analysis, characterisation, 

CC diagnostic or therapeutic use. They can also be used as tissue or 

CC mol.wt. markers, for chromosome identification, to identify genetic 

CC disorders, to isolate new related DNA, as sources of primers for 

CC PCR, to generate antibodies, and in interaction trap assays. The 

CC secreted proteins may also have many biological activities, e.g. 

CC cytokine, immunomodulator, haematopoiesis regulating activity, 

CC tissue growth activity, activin or inhibin activity, chemotactic or 

CC chemokinetic activity, haemostatic and thrombolytic activity, 

CC receptor/1 igand activity, antiinflammatory, cadherin and tumour 

CC invasion suppressor activity, and tumour inhibition activity. The 

CC proteins can be expressed in vivo from DNA, introduced in gene 

CC therapy vectors . 

XX 

SQ Sequence 2386 BP; 756 A; 450 C; 494 G; 686 T; 0 other; 

Query Match 37.7%; Score 1411.2; DB 19; Length 2386; 

Best Local Similarity 83.3%; Pred. No. 1.4e-289; 

Matches 1702; Conservative 0; Mismatches 303; Indels 39; Gaps 7; 



Qy 


1718 


CCAAAACGTCAAATCCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAA 

Illllll Mill Mill llllllllll IMMMMIMI MINI MM 

CCAAAACATCAAACCCTTTTCTTGTAGCAGCACAGG 


1777 


Db 


1 


60 


Qy 


1778 


CAGATACCTTATCAAAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGC 

IIMM Ml Mill Mill II Ml llllllll lllli IMM 1 

CAGATAATTTAACTW^GGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTC 


1837 


Db 


61 


120 


Qy 


1838 


CAGATTTAGTTCAGGAAGC^TGTGAAAGTGAACTGAATGAAGCCACAGGTACmAGATTG 

llllllllll llllll MUM MM Illllllll II IMM Mill 

CAGATTTAGTACAGGAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTG 


1897 


Db 


121 


180 



Qy 1898 CTTATGAAAC^lAAAGTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCACTTTACC 1957 

llllllllllllll lllllllll II llllll Mill Ml M M I MMI 1 1 I 

Db 181 CTTATGAAACAAAAATGGACTTGGTT 24 0 

Qy 1958 CCACAGCACAGCTTTGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGC 2 017 

I MMMMMMMMMMMMI 1 1 I IMM MMI 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 241 CTGCAGCACAGCTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGC 3 00 

Qy 2 018 CTGATATTGTTATGGAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAG 2 077 

1 1 1 1 MIMMIMI Mill llllll I I I I II Illllllllll II 

Db 3 01 CTGACATTGTTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGA 360 

Qy 2 078 TGCAGCCCAGTGTATCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGC 2137 

I llllllll III III I 1 1 1 1 II II llllll MINI II Mill I 

Db 3 61 TACAGC CTCATCACCCTTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAAC 417 

Qy 213 8 TTGAG CCTGAAAAC CCC CCACCATATGAAGAAGC CATGAATGTAG CACT AAAAGCTT 2194 

M, ill llllli II llllll Mill MM 1 1 1 1 1 1 1 1 Mill I 

Db 418 ATGAG CCTGAAAACCCC CCACCATATGAAGAGGC CATGAGTGTATCACTAAAAAAAGTAT 4 77 

Qy 2195 TGGGAACAAAGGAAGGAATAAAAGAG CCTGAAAGTTTTAATG CAG CTGTTCAGGAAACAG 2254 

MM llllllll III lllllllllllll I IIMIIIMII MM I M 1 1 1 1 

Db 478 CAGGAATAAAGGAAGAAATTAAAGAG CCTGAAAATATTAATG CAG CTCTTCAAGAAACAG 537 

Qy 2255 AAGCTCCTTATATATCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGC 2314 

MIIIIIIIIIIMII Mill IMIMIIIMIIIIIIIIIIIIIM II MM I 

Db 538 AAGCTCCTTATATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAAC 597 

Qy 2315 CAAGTCCAGATTTCTCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAAC 2374 

I I IvMMttl I lllllllllllll llllll I || V || I 

Db 5 98 CAGCTCCGGATTTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCGTGATC 657 

Qy 23 75 ACGCTGAGCTAGTGGAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATG 2434 

I 1 1 1 1 1 1 1 1 : 1 II 1 1 1 1 M 1 1 M 1 1 1 1 lllllllll III IIMIIIMII MM 

Db 658 ATTCTGAGCTAGTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATG 717 

Qy 2435 ATTCGATTCCTGAAGTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTG 24 94 

MM II Mill II MIIMI llllll II IIIMIMM MM II MM 

Db 718 ATTCAATACCTGACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTC 777 
Qy 24 95 TCACTGA AGTGT CTGAGACAGTAGC CCAG CACAAAGAGGAGAGACTTAGTG CCT 2 548 

MIMM II llllllll MM I III Mill I 

Db 778 TCACTGAGACTTCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTT 83 7 

Qy 2 54 9 CACCT CAGGAG CTAGGAAAGCCATATTTAGAGT CTTTTCAG CCCAATTTACATAGTACAA 26 08 

II I Ml Mill II MUM Ml || MM Ml MM 

Db 838 TGCCACCTGAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAA 8 97 
Qy 26 09 AAGATGC TGCATCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGC 2665 

Mill I I II MM II lllllllll Mill IIIIIMM, llllll 

Db 8 98 AAGATACCCTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGC 957 

Qy 2666 AAATGGAAGAGTTTAATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAG 2725 

I Mill III I I MIIMI 1111:11 II Illllllllll 

Db 958 AGATGGAGGAGCTCAGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAG 1017 

Qy 2726 ACAAAATAAAAGAAAGTGAAACATTTTCAGATT CATCT CCGATTGAGATAATAGATGAAT 278 5 



Db 1018 CACAGATAAGAGAAACTGAAACGTTTTCAGATT CATCT CCAATTGAAATTATAGATGAGT 1077 
Qy 278 6 TTCCCACGTTTGTCAGTGCTAAAGATGATTC T CCTAAATTAG C CAAGGAGTACACTG 2842 

I II II II MM! Mill llllll ! MMMMIMI Ml II MM 

Db 1078 TCCCTACATTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTG 1137 

Qy 2843 ATCTAGAAGTAT CCGACAAAAGTGAAATTGCTAATATCCAAAG CGGGG CAGATTCATTGC 2902 

I MMMMIMI II MINI Illlllll II M II I MM 

Db 113 8 ACCTAGAAGTAT CC CACAAAAGTGAAATTGCTAATG CCCCGGATGGAG CTGGGTCATTG C 1197 
Qy 2 903 CTTG CTTAGAATTG CCCTGTGAC CTTTCTTT CAAGAATATATATC CTAAAGATGAAG 2 959 

Mill MINIM MMMMIMI Mill Ml I II MM Mill 

Db 1198 CTTGCACAGAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGA 12 57 
Qy 2 96 0 TACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATAT 3 016 

I I I I'M M II Mill MM I III MM I III 

Db 1258 AAATCAGTTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTAT 1317 

Qy 3017 CGCCTTCAAATGTCTCTGCTTTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCA 3076 

MM II MM MMIIIMI MM MM II I MMMMIMI III 

Db 1318 TGCCTCCAGATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCA 13 77 

Qy 3 077 AATCACTTACGAAAGAAG CAGAGAAAAAACTTC CTTCTGACACAGAGAAAGAGGACAGAT 3136 

II III IMIMIII MMMMMMIMM II Mill MM Mill 

Db 1378 AAGTT CTTGTGAAAGAAG CTGAGAAAAAACTTCCTT CCGATACAGAAAAAGAGGACAGAT 1437 

Qy 3137 CCCTGTCAGCTGTATTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACT 3196 

I I II III MM MMMMMMMMMMMMMMIMMIMM MM 

Db 143 8 C@C(CBTH0SCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGT5OXE GTACT 14 97 

Qy 3197 GGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTC 32 56 

MiMMIII MIMI N MIMI MIMI Mill IMMMMMM II 

Db 14 98 GGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTG CCAG CCTATTCCTG CTG CTTT CAT 1557 

Qy 3257 TGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGA 3316 

Mill II Mill II Mill MMMMMMMMMIMIMM MM 

Db 1558 TGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGA 1617 

Qy 3317 CTATCAG CTTTAGGATATATAAGGG CGTGATC CAGGCTATCCAGAAATCAGATGAAGG CC 33 76 

I 1 1 1 1 : I 1 1 1 1 . 1 1 1 ' Mill MIMI IIIIIMI MIM; Mill 

Db 1618 CCAT CAG CTTTAGGATATACAAGGGTGTGATC CAAG CTATC CAGAAAT CAGATGAAGG CC 1677 

Qy 3377 ACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAAT 3436 

Ml MM M Illlllllllllll Mill MM MIM I 

Db 1678 ACCCATTCAGG GAAGTTG CTATATCTGAGGAGTTGGTTGAGAAGCG 3722 

Qy 3437 ACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCT 34 96 

MMMMMMMMMMMMIMM MM Mill Mill Mill II MM 

Db 1723 ACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCT 1782 

Qy 34 97 TAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATG 3556 

llllll! M MIIIMIIII II Mill llilllllllllllll Mill Mill MM 

Db 1783 TAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATG 1842 

Qy 3557 TTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTA 3616 

llllll lllllll III II 1 1 1 1 1 1 1 MM I Ml 1 1 Mill II MMMMIMI 



Db 



1843 TTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTG 19 02 



Qy 



3617 TTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACA 3676 



Db 




Qy 



3677 AGAGTGTTAAGGATG CCATGG CCAAAAT C CAAG CAAAAATC CCTGGATTGAAG CGCAAAG 3 73 6 



Db 




Qy 



3737 CAGA 3740 



Db 



2023 CTGA 2026 



RESULT 14 
AAF98399 

ID AAF98399 standard; cDNA; 2386 BP. 
XX 

AC AAF98399; 
XX 

DT 07-JUN-2001 (first entry) 
XX 

DE Human cDNA clone BG160_1 sequence SEQ ID 41. 
XX 

KW Human; secreted protein; nutrient; cytokine modulator; proliferation; 

KW differentiation; immune system modulator; tissue growth; chemotactic; 

KW haemostatic; thrombolytic; anti - inflammatory ; tumour inhibition; ss; 

KW haematopoiesis . 
XX 

OS Homo sapiens. 
XX 

PN WO200119988-A1. 
XX 

PD 22-MAR-2001. 

XX 

PF 14-SEP-2000; 2000WO-US25135 . 
XX 

PR 17-SEP-1999; 99US-0398829 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Jacobs K, McCoy JM, LaVallie ER, Collins-Racie LA, Evans C; 

PI Merberg D, Treacy M, Bowman MR, Spaulding V, Agostino MJ; 
XX 

DR WPI; 2001-244801/25. 

DR P-PSDB; AAB90682. 
XX 

PT Isolated nucleic acids encoding polypeptides, useful for modulating 

PT e.g. cytokine and cell proliferation/differentiation activity, the 

PT immune system and hematopoiesis regulating activity - 
XX 

PS Claim 1; Page 408-409; 557pp; English. 
XX 

CC Human cDNA clones represented in AAF98374 - AAF98489 encode secreted 

CC proteins AAB90667 - AAB90750. The cDNA clones are isolated from various 

CC tissue types, and may be used in the prevention, treatment and diagnosis 



CC of diseases associated with inappropriate protein expression. The 

CC polypeptides and nucleic acids may be used as nutrients or to modulate 

CC cytokine and cell proliferation/differentiation activity and may also be 

CC involved in modulation of the immune system. The cDNA sequences, 

CC proteins, their agonists and/or antagonists exhibit haematopoiesis 

CC regulating activity; tissue growth activity; activin/inhibin activity; 

CC chemotactic/chemokinetic activity; haemostatic and thrombolytic 

CC activity; receptor/ ligand activity; ant i- inflammatory activity; 

CC haematopoiesis activity; cadherin/tumour suppressor activity; and/or 

CC tumour inhibition activity. Included in the invention are probes 

CC represented in AAF98490 - AAF98572 which are specific for the cDNA clones 

CC encoding the secreted proteins. 

XX 

SQ Sequence 2386 BP; 756 A; 448 C; 496 G; 686 T; 0 other; 

Query Match 37.6%; Score 14 08; DB 22; Length 23 86; 

Best Local Similarity 83.2%; Pred. No. 6.5e-289; 

Matches 1700; Conservative 0; Mismatches 305; Indels 39; Gaps 7; 
Qy 1718 COVAAACGTCAAATCCTTTCCTTGTAGCAGTACAGGATT^ 1777 

lllllll Mill Mill III lllllllllllll II MM 

Db 1 CCAAAACATCAAACCCTTTTCTTGTAGCAGCAC^ 60 

Qy 1778 CAGATACCTTATCAAAGGTGACTGAGGCTIGC^ 1837 

IIIIII III Ml IIIIII II II III II M Milll li Mill I 

Db 61 CAGATAATTTAACAAAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTC 12 0 

Qy 1838 CAGATTTAGTT CAGGAAG CATGTGAAAGTGAACTGAATGAAG CCACAGGTACAAAGATTG 18 97 

llllllllll ,1 ! I l IIIMIMI II II:. 

Db 121 CAGATTTAGTACAGGAAG(^TGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTG 18 0 

Qy 18 98 CTTATGAAACAAAAGTGGACTTGGT C CAAACATCAGAAG CTATACAAGAATCACTTTAC C 1957 

IIIIMMIIIIII llllllllll lllllllllllll III Mill Mill II I 

Db 181 CTTATGAAACAAAAATGGACTTGGTTCAAAC^TCAGAAG 24 0 

Qy 1958 CCACAGCACAGCTTTGCCCATC^T 2017 

I IIIIIMIIIIIIIIMMIMIII II I Mill Mill IIIIIMIIIIII 

Db 241 CTGCAG(^CAGCTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGC 30 0 

Qy 2 018 CTGATATTGTTATGGAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAG 2 077 

MM illllli:. IIIIII M MUM MM II lllllll II 

Db 3 01 CTGACATTGTTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGA 360 

Qy 2 078 TGCAGCCCAGTGTATCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGC 213 7 

I llllllll III III I MM II II IIIIII 1 1 1 II t II Mill I 

Db 361 TACAGCCCAGCTCATCACCA AGAAG CTTCTTCAGTTAATTATGAAAGCAtEAAAAC 417 

Qy 213 8 TTGAGCCTGAAAACCCCCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAGCTT 2194 

II 1 1 1 II 1 1 II I II I II II I II 1 1 1 M 1 1 1 lllllll MM MM Mill I 

Db 418 ATGAG CCTGAAAACCC C CCACCATATGAAGAGG CCATGAGTGTATCACTAAAAAAAGTAT 477 

Qy 2195 TGGGAACAAAGGAAGGAATAAAAGAG CCTGAAAGTTTTAATG CAGCTGTT CAGGAAACAG 22 54 

1 1 1 1 llllllll III MM;. Mill I lllllllllll MM MIIMI 

Db 478 CAGGAATAAAGGAAGAAATTAAAGAG C CTGAAAATATTAATG CAGCT CTTCAAGAAACAG 537 

Qy 2255 AAGCTCCTTATATATCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGC 2314 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill MMMMMMIMMMMIMM II MM I 



Db 53 8 AAG CT C CTTATATATCTATTG CATGTGATTTAATTAAAGAAACAAAG CTTTCTG CTGAAC 597 

Qy 2315 CAAGTCCAGATTTCTCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAAC 2374 

II III Mill : , : 1 1 1 1 1 1 : 1 1 1 Illlll III Ml Mill || | 

Db 598 CAGCTCCGGATTTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATC 657 

Qy 2375 ACGCTGAGCTAGTGGAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATG 2434 

I MINIMI I! MMMI MM 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 II II II 1 1 1 1 

Db 658 ATTCTGAGCTAGTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATG 717 

Qy 243 5 ATTCGATTCCTGAAGTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTC 24 94 

1 1 1 1 II Mill II MMMI Illlll II IMIMIMI 1 1 1 1 II MM 

Db 718 ATTCAATACCTGACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTC 777 
Qy 24 95 TCACTGA AGTGT CTGAGACAGTAG C CCAGCACAAAGAGGAGAGACTTAGTG CCT 2548 

MMMI II I III I I II 1 1 1 1 I Ml Mill I 

Db 778 TCACTGAGACTTCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTT 83 7 

Qy 254 9 CACCTCAGGAGCTAGGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAA 26 08 

II I III MINIMUM II Mllll Ml II MM III MM 

Db 838 TGCCACCTGAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAA 897 
Qy 2609 AAGATGC TGCATCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGC 2665 

Mill i I II MM II Mill! MINIMUM Illlll 

Db 898 AAGATACCCTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGC 957 

Qy 2666 AAATGGAAGAGTTTAATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAG 2725 

I Mill III I I MMMI I i 1 1 1 1 1 1 1 1 1 1 { I f 1 1 1 1 1 II I MMM 

Db 958 AGATGGAGGAG CT CAGTACTG CAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAG 1017 

Qy 272 6 ACAAAATAAAAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAAT 2785 

I MM Mill Illlll MMMMIIMIMM Mill II MMM I 

Db 1018 C^CAGATAAGAGAAACTGAAACGTTTTCTVGATTCATCTCCAATTGAAATTATAGATGAGT 1077 

Qy 2 78 6 TTCCCACGTTTGTCAGTGCTAAAGATGATTC- - -TCCTAAATTAGCCAAGGAGTACACTG 2842 

I II II II Mill Mill MMM I INI II I III II MM 

Db 107 8 TCCCTACATTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTG 1137 

Qy 2843 ATCTAGAAGTATCCGACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGC 29 02 

I MMIMIMM II MIMi MINIM II II II I MMMI 

Db 1138 ACCTAGAAGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGC 1197 
Qy 2 903 CTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG 2959 

Mill llllllllll MIMMMMI Mill Ml I II MM Mill 

Db 1198 CTTGCACAGAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGA 12 57 
Qy 2 960 TACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATAT 3016 

I I I Ml N II II Mill MM I Ml MM I III 

Db 1258 AAATCAGTTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTAT 1317 

Qy 3017 CGCCTTCAAATGTCTCTGCTTTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCA 3076 

MM M MM llllllllll MM MM II I Nil 1 1 1 N III 

Db 1318 TGCCTCCAGATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCA 13 77 

Qy 3 077 AATCACTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGAT 3136 

II III MINIMI IIIIIIIMII Illlll II Mill MIMMIMIM 

Db 1378 AAGTTCTTGTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGAT 1437 



Qy 3137 CCCTGTCAGCTGTATTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACT 3196 

I I II III MM MIMMMMIMMMMMMMMIMMIMM MM 

Db 1438 CACCATCTGCTATATTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACT 14 97 

Qy 3197 GGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTC 3256 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 MM MINI M 

Db 14 98 GGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCAT 1557 

Qy 32 57 TGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGA 3316 

MM 1 1 1 1 1 1 1 1 1 1 1 II MM Mill Illllllll MMM MM 

Db 1558 TGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGA 1617 

Qy 3317 CTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCC 33 76 

I II MMM Mill Mill MMM 1 1 II I II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

Db 1618 CCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCC 1677 

Qy 3377 AC CCATTCAGGG CATATTTAGAATCTGAAGTTG CTATATCAGAGGAATTGGTT CAGAAAT 343 6 

MM MM MMIMMMIM Mill 1 1 1 1 1 i 1 1 1 1 1 I 

Db 1678 ACCCATTCAGG GAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGT 1722 

Qy 3437 ACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCT 34 96 

1 1 1 1 1 f 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 MM Mill Mill Mill II MM 

Db 1723 ACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCT 1782 

Qy 34 97 TAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATG 3556 

IN: Mill MMM M MMM M Mill MM 

Db 1783 TAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATG 1842 

Qy 3557 TTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTA 3616 

Mill III Mill Ml Ml II II I III Ml MMM Mill |t g\\Wg i848 

Db 1843 TTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTG 1902 

Qy 3617 TT CCTGTTATTTATGAACGG CAT CAGGTG CAGATAGATCATTATCTAGGACTTG CAAACA 3676 

II MMM MIIIIMIMM II 1 1 1 1 1 1 1 1 II 1 1 II I II 1 1 1 1 II II II I 

Db 1903 TTGGTGTTATTTATGAACGG CAT CAGG CA CAGATAGATCATTATCTAGGACTTG CAAATA 1962 

Qy 3677 AGAGTGTTAAGGATGCCATGGCCAAAATCCAAGCAAAAATCCCTC 3736 

III MMM Mill Mill Mill MMM II IMMM Mill II Ml 

Db 1963 AGAATGTTAAAGATG CTATGGCTAAAATC CAAG CAAAAATCC CTGGATTGAAG CGCAAAG 2 022 

Qy 3737 CAGA 3740 

Db 2 023 CTGA 2 02 6 



RESULT 15 
AAI98079 

ID AAI98079 standard; cDNA; 1980 BP. 
XX 

AC AAI98079; 
XX 

DT 04-DEC-2001 (first entry) 
XX 

DE Human neuroblastoma expressed polynucleotide SEQ ID NO 22. 
XX 

KW Human; neuroblastoma; ss . 



XX 

OS Homo sapiens . 
XX 

PN WO200166733-A1 . 
XX 

PD 13-SEP-2001. 
XX 

PF 02-MAR-2001; 2 001WO- JP0163 1 . 
XX 

PR 07-MAR-2000; 2000JP-0159195 . 

PR 12-MAY-2000; 2000JP-0140387 . 
XX 

PA (CHIB-) CHIBA PREFECTURE. 

PA (HISM ) HISAMITSU PHARM CO LTD . 

XX 

PI Nakagawara A; 
XX 

DR WPI; 2001-602630/68. 
XX 

PT Nucleic acids for prognosis of human neuroblastoma comprise nucleic 

PT acids expressed by human neuroblastomas - 

XX 

PS Claim 1; Page 69-70; 159pp; Japanese. 
XX 

CC The invention relates to nucleic acids (AAI98058-AAI98161) or their 

CC homologues expressed by human neuroblastomas useful for detecting genes 

CC expressed by neuroblastoma and for analysing their structure and 

CC function. The nucleic acids are useful for the diagnosis and prognosis of 

CC neuroblastoma. 

XX 

SQ Sequence 1980 BP; 601 A; 373 C; 423 G; 583 T; 0 other; 

Query Match 29.1%; Score 1088.8; DB 22; Length 1980; 
Best Local Similarity 83.5%; Pred. No. 3.7e-221; 

Matches 1289; Conservative 0; Mismatches 237; Indels 18; Gaps 4; 

Qy 2215 AAAGAG CCTGAAAGTTTTAATG CAG CTGTT CAGGAAACAGAAGCT CCTTATATAT CCATT 2274 



Db 




Qy 




Db 



Qy 



2335 TATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTGGAGGAT 2394 





Db 



148 TATTCAGAAATGG CAAAAGTTGAACAG C CAGTGCCTGATCATTCTGAGCTAGTTGAAGAT 2 



Qy 




Db 



Qy 




Db 



Qy 



2 5 09 GAGACAGTAGCCCAGCACAAAGAGGAGAGACTTAGTGCCTCACCTCAGGAGCTAGGAAAG 2568 



I III I I II 1 1 1 1 I Ml Mill I II I III MUM, 

Db 328 TCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGCCACCTGAGGGAGGAAAG 387 
Qy 2 569 CCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCATCTAAT 2625 

MM II MUM Ml II MM Ml IIIMIMI I I II II 

Db 388 CCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTGTTACCTGAT 447 

Qy 2626 GACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTTAATACT 2 685 

II II IIIMIMI Ml MMIMM MM M Mill Ml I I MM 

Db 448 GAAGTTTCAACATTGAG CAAAAAGGAGAAAATTC CTTTG CAGATGGAGGAG CTCAGTACT 507 

Qy 2686 GCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTGAA 2745 

III MIMMMMMMIMM II IMIMIMM I MM Mill MM 

Db 5 08 GCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAAACTGAA 567 

Qy 2746 ACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGCT 28 05 

II Ml 1 1 III IIIMIMI Mill II Mill III M If II M Mill II 

Db 568 ACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATCAGTCCT 627 
Qy 2806 AAAGATGATTC TCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCCGACAAA 2862 

III MUM I IMMMIMM III II Mill NUN Nil Mill 

Db 628 AAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCCCACAAA 687 

Qy 28 63 AGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGT 2 922 

MIMMMMIMI II II II I MMIMM IMIIIMM I 

Db 688 AGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTGCCCCAT 74 7 
Qy 2 923 GACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTT CAGATGAA 2 976 

IIMMIMM Mill Ml I II MM Mill I I I Mill 

Db 74 8 GACCTTTCTTTGAAGAACA.TACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCAGATGAC 807 

Qy 2 977 TTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTCTCTGCT 3 036 

1 1 II Mill MM I Ml MM I III MM II MM Ml 

Db 8 08 TTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTTTCTGCT 8 67 

Qy 3 037 TTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCAAATCACTTACGAAAGAAGCA 3 096 

MM Mill MM II I IMMMIMM Mill III MMIMM 

Db 8 68 TTGG CCACTCAGG CAGAGATAGAGAGCATAGTTAAAC CCAAAGTT CTTGTGAAAGAAG CT 927 

Qy 3 097 GAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTATTGTCA 3156 

1 1 1 1 1 : 1 1 1 1 II Mill Ml M Mill I II Ml MM III 

Db 928 GAGAAAAAACTT CCTTCCGATACAGAAAAAGAGGACAGATCACCATCTG CTATATTTT CA 98 7 

Qy 3157 GCAGAG CTGAGTAAAACTT CAGTTGTTGAC CT CCT CTACTGGAGAGACATTAAGAAGACT 3216 

1 1 1 1 1 ! 1 1 1 1 r 1 1 1 1 i 1 1 1 1 1 1 1 1 ; 1 1 1 M 1 1 1 1 i ! I i M I ! 1 1 1 1 i 1 1 

Db 98 8 GCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACT 1047 

Qy 3217 GGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTC 32 76 

IMMIIIII lllillllll 1 1 1 1 1 1 llilll II lllllll 1 1 1 1 1 1 1 1 1 1 1 

Db 1048 GGAGTGGTGTTTGGTGCCAGCCTATTCCAGCTGCTTTCATTGACAGTATTCAGCATTGTG 1107 

Qy 3277 AGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATAT 333 6 

II Mill 1 1 1 1 1 1 1 1 1 : 1 I ! 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 M 1 1 1 1 1 f I 

Db 1108 AGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATAC 1167 

Qy 3337 AAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCATATTTA 33 96 

Mill 1 1 1 1 1 1 1 1 I i ! I 1 1 1 1 i 1 1 1 1 1 ! 1 1 1 1 II 1 1 II I I III II! 1 1 1 ! I 



Db 1168 AAGGGTGTGATCC^GCTATCC^GAAATC^GATGAAGGCCACCC^TTCAGGGCATATCTG 1227 

Qy 33 97 GAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGT 3456 

MINI MUM Ml Mill Ml MUM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 

Db 1228 GAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGT 1287 

Qy 3457 CATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGAT 3516 

Mill 1 1 1 1 Mill Mill Mill 1 1 1 1 1 S I 1 1 1 1 ! I 1 1 1 1 1 1 i 1 1 1 1 

Db 1288 CATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGAT 1347 

Qy 3517 TCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGT 3576 

II III MUM II Ml II MM Mill MM llllll Ml! Ill 

Db 134 8 TCTCTGGAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGT 14 07 

Qy 3577 CTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGG 3636 

MIIMIMIMIMM Mill II MMIMIMM 

Db 14 08 CTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGG 14 67 

Qy 3637 CAT CAGGTGCAGATAGATCATTATCTAGGACTTG CAAACAAGAGTGTTAAGGATG CCATG 3 696 

IMIMI MM II IM I! MIlMilllll MM llllll Mill III 

Db 14 68 CATCAGG CACAGATAGAT CATTATCTAGGACTTG CAAATAAGAATGTTAAAGATG CTATG 1527 

Qy 36 97 GCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGA 374 0 

II M! llllll II IMM! IIIMIIIIIIMM II 

Db 152 8 GCTAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCTGA 1571 
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ALIGNMENTS 



RESULT 1 

US-09-484-970B-106 

; Sequence 106, Application US/09484970B 

; Patent No. 6426186 

; GENERAL INFORMATION: 

; APPLICANT: Jones , Karen A. 

; APPLICANT: Volkmuth, Wayne 

; APPLICANT: Walker, Michael G . 

; TITLE OF INVENTION: BONE REMODELING GENES 

; FILE REFERENCE: PB-0014 US 

CURRENT APPLICATION NUMBER: US/0 9/484 , 970B 
CURRENT FILING DATE: 2000-01-18 

; NUMBER OF SEQ ID NOS : 172 
SOFTWARE: PERL Program 

; SEQ ID NO 106 
LENGTH: 4 822 
TYPE: DNA 

ORGANISM: Homo sapiens 



FEATURE : 

NAME /KEY : misc_f eature 

OTHER INFORMATION: Incyte ID No. 6426186 444857. 15CB1 
NAME/ KEY : unsure 

LOCATION : 33, 51, 79, 211, 369, 483-484, 731, 748, 4803, 4805-4806, 4808- 
4809, 

OTHER INFORMATION: a, t, c, g, or other 
US-09-484-970B-106 

Query Match 62.1%; Score 2323.8; DB 4; Length 4822; 

Best Local Similarity 80.9%; Pred. No. 0; 

Matches 3060; Conservative 0; Mismatches 587; Indels 137; Gaps 25; 
Qy 63 CGCGAAGGCAGCAGAAGCAGTCTCATTGTTCCGGGAGCCGTCGCCTCTGCAGGTTCTTCG 122 

I II MINI II IIIIIIIIIIIIMIIIIIMIIIMI III I 1 1 1 1 I III 

Db 78 CNCGGAGGCAGGAGGAGCAGTCTCATTGTTCCGGGAGCCGTCACCACAGTAGGTCCCTCG 137 

Qy 123 GCTCGGCTCGGCACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACAAC 182 

1 1 1 1 I I III MM I Mill I MUM hill 

Db 138 GCTCAGT CGGCCCAGCCCCTCTCAGTCCTCCCCAACCCCCACAAC 182 

Qy 183 CGCCCGCGACTCTGAGGAGAAGCGGC - CCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCG 241 

Mill II II I II I 1 1 , 1 i 1 1 1 I IIIIIIIMIII II I 

Db 183 CGCCCGCGGCTCTGAGACGCGGCCCCGGNGGCGGCGGCAGCAGCTGCAGCATCATC - TCC 241 

Qy 242 ACCCGCCAGCCATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCC 301 

MM MIMIMMIMM I Mill I Mill M Ml MIMIIII 

Db 242 ACCCTCCAGCCATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCC 298 

Qy 3 02 CGCCCCGGCCTCCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGG 361 

I Illlllll I llllll IIIIMIIIIIIIMIIII 

Db 299 CACCCCGGCCGCAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG- 357 

Qy 362 ACGAGGAGGAGGAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGC 421 

M II Mill II II IIIIIMI II Mill Mill Mill Mill Ml 

Db 3 5 8 - - GAAGAAGAGGANGATGAAGAGGAGGACGAGGACGAAGACCTGGAGGAG CTGGAGGTG C 415 
Qy 422 TGGAGAGGAAGCCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCG 475 

MMMMMMIMI MMMMMIMI II Mill I II lllllll 

Db 416 TGGAGAGGAAGCCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCG 475 

Qy 4 76 CCGCGCCGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGC 535 

MUM I MM I I I MM Mill II IIIIIMI II II MM 

Db 476 GCGCGCCNNTAATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGC 53 5 
Qy 53 6 CGGCCGCGCCCCCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG 58 9 

lllllll Mill I Ml II III MM M Mill I Mill I 

Db 53 6 CGGCCGCTCCCCCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGT 595 
Qy 5 90 CGGCGCCCGCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAG 64 6 

I I MIMIIII II I I MIMMIMM IIIIIIIIIIMIII I 

Db 596 CGACCGTGCCCGCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTG 655 

Qy 647 AGGACGACGAGCCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGG 706 

MIIMIIIIIIIIIIIII MM M Mill M II III III IIIIIMI II 

Db 656 AGGACGACGAGCCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGG 715 



Qy 707 CGGAGC CCGCCGCGCCCCCTTCCACGCCGGCCG 73 9 

I II I I lllllllllllll Mill lllllll 

Db 716 CAGAGCCCGTGTGGANCCCGCCAGCCCCGGCTNCCGCCGCGCCCCCCTCCACCCCGGCCG 775 

Qy 74 0 CGCCCAAGCGCAGGGGCTCC- - -GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTG 796 

IMIIIIIIIIIMIIIIII NIMMIMI MMM M llllll MUM 

Db 776 CGCCCAAGCGCAGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTG 835 

Qy 7 97 CATCTGAGCCTGTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAG 856 

IIIIIIIIIIIMIIIII lllllllllllllll llllll 1 1 1 1 lllllllllll 

Db 83 6 CATCTGAGCCTGTGATACGCTCCTCTGCAGAAAA-- -TATGGACTTGAAGGAGCAGCCAG 892 

Qy 8 57 GTAACACTGTTTCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCT 916 

IIMIIII Mill Mi llllll MINIMI MINI li llllll IIIM I 

Db 893 GTAACACTATTTCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTT 952 

Qy 917 CTCTTCCTTCTCTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTA 976 

lllllllllllll llllllllllll I I llllll Illillllll lllllllllll 

Db 953 CTCTTCCTTCTCTGTCTCCTCTCTCAGCCGCTTCTTTCIAAAGAACATGAATACCTTGGTA 1012 

Qy 977 ACTTATC^GCAGTGTC^TCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTA 103 6 

I II III MM MM I E 1 1 1 1 1 1 1 II llllll I I I MIMIMIM 

Db 1013 ATTTGTCAACAGTATTACCCACTGAAGGAACACTTCA^ 1072 

Qy 1037 AAGAGTTGCCAGAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAG 1096 

Mill I llllll Mill llllll II IMIIIIMM MM lllllll 

Db 1073 AAGAGGTCT(ZAGAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAG 1132 

Qy 10 97 AATTAGAATATTC^GAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCA 1156 

M I M 1 1 It I IMIMMIIIIMM! II I I III lllllll III II III 

Db 1133 AATTAGAATACTCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCG 1192 
Qy 1157 TATTAGTAGAAAACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATT 1213 

II llllll II! Ill lllllll MM MM I MM II Mill I I 

Db 1193 TAATAGTAGCAAATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGT 1252 
Qy 1214 TAGTTTGTAGTGCAGCCCTTCACAGTCCACAAGAATCACCT GTGG 1258 

MM! Ill I llllll I II llllll I 1 1 1 1 Ml 

Db 12 53 TAGTTAGTAATAACATCCTTGA.TAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGG 1312 

Qy 12 59 GTAAAGAAGACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGA 1318 

MUM II MMIIIMI lllllll III MM lllllllllll I 

Db 1313 TTAAAGAGGATGAAGTTGTGTCTTCAGAAAAAGOWU^GA^GTTTTAATGAAAAGAGAG 1372 

Qy 1319 TGT(^GTAGTAGC7\CCTGTGAGGGAAGAGTATGCAGACTTTAAGCC^TTTGAA{^G(^lT 1378 

I MM I III III lllllll II lllllllllll II MIIIMI I II II 

Db 1373 TTGCAGTGGAAGCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTAT 14 32 

Qy 1379 GGGAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTGCTGGCTGCTAGAGCT 1431 

II I II I I I I II I I II I III I III Ml II IIMIIII III I 

Db 1433 GGGAAGTGAAAGATA GTAAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAATCG 14 8 9 

Qy 1432 AATGTGGAAAGTAAAGTGGACAGAAAATG CTTGGAAGATAG CCTGGAG CAAAAAA 14 8 6 

M MINIMUM I llllll II I lllllllll lllllll I 

Db 14 90 AGAGCAACTTGGAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTA 154 9 



Qy 



14 87 GTCTTGGGAAGGATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAAC 154 6 



II I II I'M I II Mill III I II MMM 

Db 155 0 AT CACGAAAAAGATAGTGAGAGTAGTAATGATGATACTTCTTT CCCCAGTACGC CAGAAG 160 9 
Qy 154 7 CTGTGAAGGACAGCTCCAGAGCATATATTACCTGTGCTTCCTTTA CCTCAGCAACCG 1603 

I I 1 1 1! I I II IMIIIIMI II MINI 1 1 1 1 1 1 I MMi I 

Db 1610 GTATAAAGGATCGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTG 1669 

Qy 1604 AAAGCACCACAGCAAACACTTTCCCTTTGTT^ 1663 

I 1 1 II II MMM III IMIIMIM Mill MMM Mill! II I 

Db 167 0 AGAGCATTGCAACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCG 172 9 
Qy 1664 ATG-AAAAAAAAATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAG ACTAGCCCC 1719 

III IIIIIIIIIIMIIIIII Ml M MM IIIIMIIIII MMM M 

Db 173 0 ATGAAAAAAAAAATAGAAGAAAAGAAGGCCCAAATAGTAACAGAGAAGAATACTAGCACC 178 9 

Qy 172 0 AAAACGTCAAATCC - TTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAAC 1778 

Mill Mill II III IMIIMIM MMI MMi MMMMM Mill 

Db 1790 AAAACATCAAACCCTTTTACTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAAC 184 9 

Qy 177 9 AGATACCTTATCAAAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCC 183 8 

MMI III MMMMMMMI II III M 1 1 1 1 1 1 1 1 1 II 1 1 1 Mill II 

Db 185 0 AGATAATTTAACAAAGGTGACTGAGGAAGT CGTGG CAAACATG CCTGAAGG CCTGACTC C 1909 

Qy 183 9 AGATTTAGTTCAGGAAG CATGTGAAAGTGAACTGAATGAAG CCACAGGTACAAAGATTG C 1898 

1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 i i 1 1 i I MMIMII II MIMMIIIIIII 

Db 1910 AGATTTAGTACAGGAAG CATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTG C 196 9 

Qy 18 99 TTATGAAAC7VAAAGTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCACTTTACCC 1958 

MMM II I Ml I MMMMM MM MMM III III Mill Mill II II 

Db 1970 TTATGAAACAAAAATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCC 2 02 9 

Qy 1959 CACAGCACAGCTTTGCCCATCATTTGAGGAAGCTGAAGCTiACTCCGTCACCAGTTTTGCC 2 018 

Ml! MMM IIMMIM II M I Mill MMI IMIIMIM 

Db 2 03 0 TGCAGCACAGCTTTGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCC 2 08 9 

Qy 2 019 TGATATTGTTATGGAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGT 2 078 

III 1 1 1 1 i 1 1 : 1 1 1 1 1 i ' 1 1 MMM MM II MMMMM M I 

Db 2 090 TGACATTGTTATGGAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGAT 214 9 

Qy 2 07 9 GCAGCCCAGTGTATCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCT 2138 

MMMM Ml III I MM II II MMM MMM II Mill I 

Db 2150 ACAGCCCAGCTCATCACCATTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAACA 2206 

Qy 213 9 TGAG CCTGAAAACC CC C CACCATATGAAGAAGCCATGAATGTAG CACT AAAAGCTTT 2195 

1 1 1 1 1 M 1 1 i I M 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 M Mill MM MM Mill I 

Db 22 07 TGAGCCTGAAAACCCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATC 2266 

Qy 2196 GGGAACAAAGGAAGGAATAAAAGAGCCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGA 2255 

MM MMMM Ml MMMM! : I MMIMII MM MMMM 

Db 22 67 AGGAATAAAGGAAGAAATTAAAGAG C CTGAAAATATTAATG CAG CT CTTCAAGAAACAGA 2326 

Qy 2256 AGCTCCTTATATATCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCC 2315 

MMMMMIMM Mill MM MMM II MMM M II MM II 

Db 2327 AGCTCCTTATATATCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACC 2386 

Qy 2316 AAGTCCAGATTTCTCTAATTATTGAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACA 2375 

I III IIMMIM II MMMMM MMM III III Mill II II 



Db 2387 AGCTCCGGATTTCTCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCA 244 6 

Qy 2376 CGCTGAGCTAGTGGAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGA 2435 

IIIIIIMII II llllllllllllll MMIMMIMMMMIMIMIMM 

Db 244 7 TTCTGAGCTAGTTGAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGA 2 506 

Qy 2436 TTCGATTCCTGAAGTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCT 24 95 

III II MM II MMM MMM II IMMIIMI MM II Mill 

Db 2507 TTCAATACCTGACGTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCT 2 566 

Qy 24 96 CACTGA AGTGTCTGAGACAGTAGCCCAGCACAAAGAGGAGAGACTTAGTGCCTC 254 9 

MINI II I III I I II 1 1 1 1 I III Mill I 

Db 2567 CACTGAGACTTCATTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTT 2 62 6 

Qy 2550 ACCTCAGGAGCTAGGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAA 2 609 

II I III lllllllllllllll II llllll III II 1 1 1 1 III Mill 

Db 2627 GCCACCTGAGGGAGGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAA 268 6 

Qy 2610 AGATGC TGCATCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCA 2666 

MM I I II MM II Ml II I II I Mill III 1 1 MMI I MMI 

Db 2687 AGATACCCTGTTACCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCA 274 6 

Qy 2 667 AATGGAAGAGTTTAATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGA 2726 

Mill III I I 1 1 1 1 1 M I h I! 1 1 1 1 1 1 1 1 M ! 1 1 II i 1 1 1 1 1 1 II 1 1 

Db 274 7 GATGGAGGAGCT CAGTACTG CAGTTTATT CAAATGATGACTTATTTATTTCTAAGGAAG C 2806 

Qy 272 7 CAAAATAAAAGAAAGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATT 2 78 6 

I MM Mill llllll MMMMMMMMI Mill II llllllll II 

Db 28 07 ACAGATAAGAGAAACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTT 28 66 

Qy 2787 TCCCACGTTTGTCAGTGCTAAAGATGATTC - - - TCCTAAATTAGCCAAGGAGTA CACTGA 2 843 

MMM IMMIIMI MMM I 1 1 1 II 1 1 II II I 1 1 1 1 1 1 1 1 1 1 

Db 2867 CCCTACATTGATCAGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGA 2926 

Qy 2844 TCTAGAAGTATCCGACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCC 2 903 

i llllll N MMI llllll Mill II M II I MMM 

Db 2 927 CCTAGAAGTATCCCACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCC 2 98 6 

Qy 2 904 TTG CTTAGAATTG CCCTGTGACCTTT CTTT CAAGAATATATAT CCTAAAGATGAAG 2 959 

MM llllll MM II II II I MM I Mill III I II MM Mill 

Db 2 987 TTGCACAGAATTGCCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAA 3 04 6 

Qy 2 9 6 0 - - TACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATC 3 017 

I I I llllllll II II Mill MM I Ml MM I III 

Db 3 04 7 AATCAGTTTCTCAGATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATT 3106 

Qy 3 018 GCCTTCAAATGTCTCTGCTTTGGAACCTC^GAt^GAAATGGGCAGCATAGTTAAATCCAA 307 7 

MM II MM MMIIIMI MM MM II I 1 1 1 1 II 1 1 1 II I Mil 

Db 3107 GCCTCCAGATGTTTCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAA 3166 

Qy 3 078 ATCACTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATC 313 7 

I Ml MMM IIIIMIIIIIMIIII II Mill IMMIIMI M 

Db 3167 AGTTCTTGTGAAAGAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATC 322 6 

Qy 3138 CCTGTCAGCTGTATTGTCAGCAGAGCTGAG - TAAAACTTCAGTTGTTGACCTCCTCTACT 3196 

I II III MM 1 1 1 1 1 1 1 . ! 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

Db 3227 ACCATCTGCTATATTTTCAGCAGAGCTGAGCTAAAACTTCAGTTGTTGACCTCCTGTACT 328 6 



Qy 3197 GGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTC 3256 

1 1 1 1 i 1 1 ! 1 1 ! I i i 1 1 1 1 : 1 1 1 1 M 1 1 1 1 1 ! I M 1 1 1 1 j 1 1 II 1 1 1 II 1 1 1 Ml II 

Db 3287 GGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCAT 334 6 

Qy 32 57 TGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGA 3316 

1 1 II Ml II Ml 1 1! I II II Mill I IN MM I ! MINIM Ml MM 

Db 3347 TGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGA 34 06 

Qy 3317 CTAT CAG CTTTAGGATATATAAGGGCGTGATCCAGG CTATC CAGAAATCAGATGAAGG CC 3376 

I II! MUM II I Mill Mill! MM MIMIIII MMM M 

Db 34 07 CCATCAGCTTTAGGATATACAAGGGTGTGAT CCAAG CTAT C CAGAAAT CAGATGAAGGCC 34 66 

Qy 3377 ACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAAT 3436 

MMM llllll I I M MMM MIMIII Mill MMM I I 

Db 34 67 ACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGT 3526 

Qy 3437 ACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCT 34 96 

1 : 1 : 1 : 1 1 1 1 II 1 1 . [ 1 1 1 1 1 : : I ! I MM Mill MM Mill II MM 

Db 3 52 7 ACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCT 3 58 6 

Qy 34 97 TAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATG 3556 

1 1 1 1 1 1 . : 1 1 1 1 i 1 1 1 1 : 1 1 1 MIM MIMIMI llllll Mill MM 

Db 3 587 TAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATG 364 6 

Qy 3557 TTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTA 3616 

lllllllllllll II Mill llllll IMIIIIII I Mill II llllllllllll 

Db 3 64 7 TTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTG 37 06 

Qy 3 617 TTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACA 3676 

1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 3 7 07 TTC CTGTTATTTATGAACGG CATCAGG CACAGATAGAT CATTATCTAGGACTTG CAAATA 3 766 

Qy 3 677 AGAGTGTTAAGGATGCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAG 373 6 

Ml MMM Mill Mill 

Db 3 767 AGAATGTTAAAGATGCTATGG CTAAAATCCAAG CAAAAATC CCTGGATTGAAGCG CAAAG 382 6 

Qy 3737 CAGA 3740 

I II 

Db 3827 CTGA 3830 



RESULT 2 
US-08-700-607-2 

Sequence 2, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 



STATE : CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700 , 607 
FILING DATE: Filed Herewith 
ATTORNEY /AGENT INFORMATION: 
NAME: Billings, Lucy J . 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-84 5-4166 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 799 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : s ingl e 
TOPOLOGY: linear 
MOLECULE TYPE: CDNA 
IMMEDIATE SOURCE: 
LIBRARY: 

CLONE: Consensus 
US-08-700-607-2 

Query Match 13.3%; Score 497.4; DB 2; Length 799; 

Best Local Similarity 92.7%; Pred. No. 9.2e-106; 

Matches 522; Conservative 0; Mismatches 41; Indels 0; Gaps 0; 
Qy 3178 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 3237 

IIIIIIIIIIIIIIIIIIIIIIMMMIMIIIIIIIIIIIIII 

Db 108 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 167 

Qy 3238 TTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCC 32 97 

llllll III I i II! I lllllllllll II Mill IMIIIIMIII 

Db 168 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 22 7 

Qy 32 98 TTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATC 3357 

lllllll MM II Mill lllllllllllllllll Mill 1 1 1 1 1 1 1 i llllll 

Db 228 TTGGCCCTGCTCTCTGTGACC^T(^GCTTTAGGATATA(^GGGTGTGATC(^AGCTATC 287 

Qy 3358 CAGAAATCAGATGAAGG CCAC C CATTCAGGG CATATTTAGAATCTGAAGTTGCTATATCA 3417 

IMIIIIIIIIIMIIIIIIIIIMIIIIIIIIIII I MIMIIMMIMIIIIM 

Db 288 CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 347 

Qy 3418 GAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAA 3477 

Mill lllllllllll IIIMIIIIIIMIIIIIIIIIIIIIIIII MM Mill 

Db 348 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 4 07 

Qy 34 78 GAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTG 3537 

Mill Mill II 1 1 M M 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 MMMMMMMIMI 

Db 4 08 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 4 67 



Qy 3538 ATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCT 3597 

Illlllll Mill lllllllllllllllll I I: I I. t: I II III 

Db 4 68 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 527 

Qy 3598 CTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCAT 3657 

II II llllllllllll NNI llllll llllll M III llllllllllll 

Db 528 CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 587 

Qy 3658 TATCTAGGACTTGCAAACAAGAGTGTTAAGGATGCCATGGCOWUVTCCAAGCAAAAATC 3717 

IIIIIIMIIMIIMI 1 1 1 1 llllll Mill Mill Ml II MM III 1 1 Ml I 

Db 588 TAT CTAGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAAAATC CAAG CAAAAATC 647 

Qy 3718 CCTGGATTGAAGCGCAAAGCAGA 374 0 

II I I M I II II II I I II I II II 

Db 648 CCTGGATTGAAGCGCAAAGCTGA 670 



RESULT 3 

US-09-149-476-254 

Sequence 254, Application US/09149476 
Patent No. 6420526 
GENERAL INFORMATION: 
APPLICANT: Rosen et al . 

TITLE OF INVENTION: 186 Human Secreted proteins 
FILE REFERENCE: PZ002P1 

CURRENT APPLICATION NUMBER: US/09/149,476 
CURRENT FILING DATE: 1998-09-08 
EARLIER APPLICATION NUMBER: PCT/US98/044 93 
EARLIER FILING DATE: 1998-03-06 
EARLIER APPLICATION NUMBER: 60/040,162 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,333 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/038,621 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,626 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,334 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,336 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,163 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/047,600 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,615 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER : 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE : 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 



EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE : 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE : 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 



EARLIER FILING DATE : 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER : 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE : 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,669 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/049,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 
EARLIER FILING DATE: 1997-10-02 



Query Match 6.1%; Score 228.8; DB 4; Length 1766; 

Best Local Similarity 63.4%; Pred. No. 2.5e-43; 

Matches 350; Conservative 0; Mismatches 202; Indels 0; Gaps 0; 
Qy 3174 TTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGC 3233 

I I II I! II I I Illllllll I lllllllllll I II Mill I 

Db 286 TGCGGTGCACGATCTGATTTTCTGGAGAGATGTGAAGAAGACTGGGTTTGTCTTTGGCAC 34 5 



Qy 3234 CAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACAT 3293 

II I II MINI! 1 1 III III Mill I lllllll I III I 

Db 346 CACGCTGATCATGCTGCTTTCCCTGGCAGCTTTCAGTGTCATCAGTGTGGTTTCTTACCT 4 05 

Qy 32 94 TGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGC 3353 

I 1 1 1 1 II Mill II II MINIM Mill II III III Mill M 

Db 4 06 CATCCTGGCTCTTCTCTCTGTC^CCATCAGCTTC^GGATCTAC^GTCCGTC^TCC^AGC 465 

Qy 3354 TAT C CAGAAAT CAGATGAAGG C CA C C CATT CAGGG CATATTTAGAAT CTGAAGTTG C TAT 3413 

I I Mill Mill I II MM lllllll II II I 1 1 II II M I 

Db 4 66 TGTACAGAAGTCAGAAGAAGGCCATCCATTCAAAGCCTACCTGGACGTAGACATTACTCT 52 5 

Qy 3414 ATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAAT 34 73 

II III I II II MM Ml MM II II I Mill I I 

Db 526 GTCCTCAGAAGCTTTCCATAATTACATGAATGCTGCCATGGTGCACATCAACAGGGCCCT 585 



Qy 3474 AAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGT 3533 

III M II II II I II II III I Mill Ml Mill I II II 

Db 586 GAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCTGT 64 5 

Qy 3534 GTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTT 3593 

ll'IIIIM II I II MM I I M || || I M M II III I 

Db 64 6 CTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATTCT 705 

Qy 3594 AGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGA 3653 

iii i i www iii iii mil i i ii ii 1 1 1 ii 

Db 7 06 TGCTGAACTGCTCATTTTCAGTGTCCCGATTGTCTATGAGAAGTACAAGACCCAGATTGA 765 



Qy 3654 TCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGATGCCATGGCCAAAATCCAAGCAAA 3713 

III III I II I M I III I I II III MM 

Db 766 TCACTATGTTGGCATCGCCCGAGATCA.GACCAAGTCAATTGTTGAAAAGATCCAAGCAAA 825 

Qy 3714 AATCCCTGGATT 3725 

Db 826 ACTCCCTGGAAT 837 



RESULT 4 

US-09-149-476-255 

; Sequence 255, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION: 186 Human Secreted proteins 
; FILE REFERENCE: PZ002P1 



CURRENT 


APPLICATION 


NUMBER 




US/09/149,476 


CURRENT 


FILING DATE 


1998- 


09 


-08 


EARLIER 


APPLICATION 


NUMBER 




PCT/US98/04493 


EARLIER 


FILING DATE 


1998- 


03 


-06 


EARLIER 


APPLICATION 


NUMBER 




60/040, 162 


EARLIER 


FILING DATE 


1997- 


03 


-07 


EARLIER 


APPLICATION 


NUMBER 




60/040,333 


EARLIER 


FILING DATE 


1997- 


03 


-07 


EARLIER 


APPLICATION 


NUMBER 




60/038, 621 


EARLIER 


FILING DATE 


1997- 


03 


-07 


EARLIER 


APPLICATION 


NUMBER 




60/040,626 



EARLIER FILING DATE : 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,334 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,336 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,163 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/047,600 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,615 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER : 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 



EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,63 0 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 



EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE : 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 19 97-04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,669 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/049,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 
EARLIER FILING DATE: 1997-10-02 

Query Match 6.1%; Score 228.8; DB 4; Length 2664; 

Best Local Similarity 63.4%; Pred. No. 3.1e-43; 

Matches 350; Conservative 0; Mismatches 202; Indels 0; Gaps 0; 
Qy 3174 TTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGC 323 3 

I I II II II II IIIIIIMI I lllllllllll I II Mill I 

Db 261 TGCGGTGCACGATCTGATTTTCTGGAGAGATGTGAAGAAGACTGGGTTTGTCTTTGGCAC 32 0 

Qy 3234 CAGCTTATTCCTGCTGCTGTCTCTGACAGTGTT(^GCATTGTCAGTGTAACGGCCTACAT 3293 

II I II lllllll II III 111 Mill I MM I III I 

Db 321 CACGCTGATC^TGCTGCTTTCCCTGGCAGCTTTC^GTGTC^tCAGTGTGGTTTCTTACCT 38 0 

Qy 32 94 TGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGC 33 53 

I 1 1 1 1 II Mill II II MM Mill II Ml III Mill II 

Db 381 CATCCTGGCTCTTCTCTCTGTCAC(^TCAGCTTCAGGATCTACAAGTCCGTCATCCAAGC 44 0 

Qy 3354 TATC(^GAAATC^GATGAAGGCCACCCATT(^GGGCATATTTAGAATCTGAAGTTGCTAT 3413 

I I Mill Mill IIIMIII lllllll II II I II II II II I 

Db 441 TGTACAGAAGTCAGAAGAAGGCC^TCCATTCAAAGCCTACCTGGACGTAGAaVTTACTCT 500 

Qy 3414 ATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGTG 3473 

M III I II II MM Ml MM I I II I MIM I I 

Db 501 GTCCTCAGAAGCTTTCCATAATTACATGAATGCTGCCATC 560 

Qy 3474 AAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGT 3533 

Ml I I M II II I M II III I Mill III Mill I II M 

Db 561 GAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCTGT 62 0 

Qy 3 534 GTTGATGTGGGTGTTTACTTATGTTGGTG CCTTGTTCAATGGTCTGACACTACTGATTTT 3593 

II IIIMIII I II lllllllllll lllllll I II II II Ml I 

Db 621 CTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATTCT 680 

Qy 3 594 AGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGA 3653 

III I I 1 1 II 1 1 III II I Mill I I II Mill II 

Db 681 TGCTGAACTGCTCATTTTCAGTGTCCCGATTGTCTATGAGAAGTACAAGACCCAGATTGA 74 0 

Qy 3 654 TCATTATCTAGGACTTG CAAACAAGAGTGTTAAGGATG C CATGG CCAAAATCCAAG CAAA 3713 

Ml Ml I II I II I III II II MIMMIMI 

Db 741 TCACTATGTTGGCATCGCCCGAGATCAGACCAAGTCAATTGTTGAA 800 

Qy 3714 AATCCCTGGATT 3 725 

I IIIMIII I 
Db 8 01 ACTCCCTGGAAT 812 



RESULT 5 
US-08-700-607-4 

Sequence 4, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Bandman, Olga 
Au-Young, Janice 
Goli, Surya K. 
Hillman, Jennifer L. 
TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY : U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/ AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,74 9 
REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1095 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
IMMEDIATE SOURCE: 

LIBRARY: THP1NOB01 
CLONE: 3187 0 
US-08-700-607-4 

Query Match 5.4%; Score 203.6; DB 2; Length 1095; 

Best Local Similarity 61.6%; Pred. No. 1.3e-37; 

Matches 337; Conservative 1; Mismatches 208; Indels 1; Gaps 1; 
Qy 3174 TTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGC 3233 

II II II II I I MIIIMM I lllllllllll I II Mill I 

Db 328 TGCGGTGCACGATCTGATTTTMTGGAGAGATGTGAAGAAGACTGGGTTTGTCTTTGGCAC 38 7 

Qy 3234 CAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACAT 32 93 

II I II Ml II II III III Mill I Mill I Ml I 

Db 388 CACGCTGATCATGCTGCTTTCCCTGGCA.GCTTTCAGTGTCATCAGTGTGGTTTCTTACCT 447 



Qy 32 94 TGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGC 3353 

I 1 1 1 1 II Mill II II MINIM Mill II III III Mill II 

Db 448 CATCCTGGCTCTTCTCTCTGTCACCATCAGCTTCAGGATCTACAAGTCCGTCATCCAAGC 507 

Qy 3354 TATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTAT 3413 

I I Mill Mill III! 1 1 1 M M II II I II II II II I 

Db 508 TGTACAGAAGTCAGAAGAAGGCCATCCATTCAAAGCCTACCTGGACGTAGACATTACTCT 567 

Qy 3414 ATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAAT 3473 

II III I II II 1 1 1 1 III 1 1 1 1 I I II I Mill I I 

Db 568 GTCCTCAGAAGCTTTCCATAATTACATGAATGCTGCCATGGTGCACATCAACAGGGCCCT 627 

Qy 3474 AAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGT 3533 

III I I II II II I II II III I Mill III Mill I II II 

Db 628 GAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCTGT 687 

Qy 3534 GTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTT 3593 

II llllll II I II lllllllllll I II II II I II II II III I 

Db 688 CTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATTCT 74 7 

Qy 35 94 AGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGA 3653 

III I I II Ml I II M I III I II MM I I I 

Db 74 8 TGCTGAACTGCTCATTTTNAGTGTCCCGATTGTNTATNAGAAGTACAAGGTTC - CAAGCA 8 06 

Qy 3654 T CATTATCTAGGACTTG CAAACAAGAGTGTTAAGGATG CCATGG C CAAAATCCAAG CAAA 3713 

II II I I MM M II II I Mill I I I MM 

Db 8 07 AAACTCCCTGGAATCGCCAAAAAAAAGGCAGAATAAGTACATGGAAACCAGAAATGCAAC 866 

Qy 3714 AATCCCT 3720 

Db 8 67 AGTTACT 873 



RESULT 6 

US-09-149-476-102 

; Sequence 102, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 186 Human Secreted proteins 
; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/09/149 , 476 

; CURRENT FILING DATE: 1998-09-08 

; EARLIER APPLICATION NUMBER: PCT/US98/044 93 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040,162 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,333 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/038,621 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,626 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,334 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,336 



EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,163 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/047,600 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,615 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/04 7,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER : 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 



EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER : 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,8 93 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER : 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE : 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER : 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 



EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/04 7,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE : 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE : 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER : 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 



EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,669 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/049,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 
EARLIER FILING DATE: 1997-10-02 

Query Match 4.8%; Score 18 0.4; DB 4; Length 7 94; 

Best Local Similarity 61.0%; Pred. No. 2.6e-32; 

Matches 332; Conservative 6; Mismatches 202; Indels 4; Gaps 3; 
Qy 3174 TTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGC 3233 

I I II MM i I II MM Mil MM III MM I 

Db 253 TGCGGTGCACGATCTGATTTTCTGGAGAGATGTGAAGAAGACTGGGTTTGTCTTTG - - GA 310 

Qy 3234 CAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCC - TACA 32 92 

II ! II MINI II III III Mill I llhlll I lh 

Db 311 CACGCTGATCATGCTGCTTTCCCTGGCAGCTTTCAGTGTCATCARTGTGGGTTTCTTAMC 3 70 

Qy 32 93 TTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGG 3352 

I I MM II Mill II II Mill MM II Ml Ml Mill I 

Db 371 TC^TCCTGGCTCTTCTCTCTGTC^CCATCARCTTCAGGATCTACAAGTCCGTCATCC^AG 43 0 

Qy 3353 CTATCCAGAAATCAGATGAAGGCCACCCATT - CAGGGCATATTTAGAATCTGAAGTTGCT 3411 

II I IMIMMIM =111111! MM M II II I II II II II 

Db 431 CTGTWCT^GAARTCAGAARAAGGCCTVTCCAWTCCAAAGCCTACCTGGACGT^ 4 90 

Qy 3412 ATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGTGAACAGCACA 3471 

I II III I M II MM III MM I I II I Mill I 

Db 491 CTGTCCTCAGAAGCTTTC(^TAATTACATGAATGCTGCO\TGGTGCAC^TCAACAGGGCC 550 

Qy 3472 ATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCA 3 531 

I Ml II II II II I II 1 1 III I IMM III Mill I II 

Db 551 CTGAAACTCATTATTCGTCTCTTTCTGGTAGAAGATCTGGTTGACTCCTTGAAGCTGGCT 610 

Qy 3532 GTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATT 3 5 91 

II II MINI II I II I i 1 1 1 ! 1 1 1 1 1 I II II II I II II II III 

Db 611 GTCTTCATGTGGCTGATGACCTATGTTGGTGCTGTTTTTAACGGAATCACCCTTCTAATT 670 

Qy 3592 TTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATA 3 651 

I III I I Mil III III Mill I I II Mill 

Db 671 CTTGCTGAACTGCTCATTTTCAGTGTCCCGATTGTCTATGAGAAGTACAAGACCCAGATT 73 0 

Qy 3652 GATCATTATCTAGGACTTGC^AACAAGAGTGTTAAGG 3 711 

Mill III I II I II I ill II II MM I 

Db 731 GATCACTATGTTGGCAT CG C CCGAGAT CAGAC CAAGTCAATTGTTGAAAAGAT CCCAAG C 7 90 

Qy 3712 AAAA 3715 

Db 791 AAAA 794 



RESULT 7 
US-08-700-607-9 

; Sequence 9, Application US/08700607 



; Patent No. 5858708 

; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 

APPLICANT: Au- Young, Janice 

APPLICANT: Goli, Surya K. 

APPLICANT : Hillman, Jennifer L ♦ 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE : CA 

COUNTRY: U.S. 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 /7 00 , 607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 

NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,74 9 

REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
; INFORMATION FOR SEQ ID NO: 9: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 261 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : s ingl e 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
IMMEDIATE SOURCE: 

LIBRARY: SPLNFET01 

CLONE: 2 8 742 
US-08-700-607-9 

Query Match 4.4%; Score 164.6; DB 2; Length 261; 

Best Local Similarity 86.7%; Pred. No. 6.7e-29; 

Matches 176; Conservative 0; Mismatches 27; Indels 0; Gaps 0; 
Qy 3237 CTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGC 32 96 



Db 



1 




Qy 



3297 



CTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTAT 3356 



Db 



61 



CTTNGCCCTGCNCTCTGTGACCATCAGCTNTAGGCTATACAAGGGTGTGATCCAAGCTAT 12 




QY 



3357 



CCAGAAATCAGATGAAGGCCACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATC 3416 



Db 



121 




Qy 3417 AGAGGAATTGGTT CAGAAATACA 343 9 

Mill III IMIMI MM 

Db 181 TGAGGAGTTGNTTCAGAAGTACA 2 03 



RESULT 8 

US-08-232-463-14/C 

; Sequence 14, Application US/08232463 

; Patent No. 5670367 

; GENERAL INFORMATION: 

APPLICANT: DORNER , F. 

APPLICANT: SCHEI FLINGER, F. 

APPLICANT: FALKNER , F. G. 

TITLE OF INVENTION: RECOMBINANT FOWLPOX VIRUS 
NUMBER OF SEQUENCES: 52 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Foley & Lardner 

STREET: 1800 Diagonal Road, Suite 500 

CITY: Alexandria 

STATE : VA 

COUNTRY : USA 

ZIP: 22313-0299 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC -DOS /MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /232 , 4 63 

FILING DATE: 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA : 

APPLICATION NUMBER: US/07/93 5 , 3 13 

FILING DATE: 

APPLICATION NUMBER: EP 91 114 300.6 

FILING DATE: 26-AUG-1991 
ATTORNEY/AGENT INFORMATION: 

NAME: BENT, Stephen A. 

REGISTRATION NUMBER : 2 9,768 

REFERENCE/DOCKET NUMBER: 30472/114 IMMU 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (703)836-9300 

TELEFAX: (703)683-4109 

TELEX: 8 9914 9 
; INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 7218 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
IMMEDIATE SOURCE: 

CLONE: pTZgpt-Fls 
US-08-232-463-14 

Query Match 2.0%; 
Best Local Similarity 5.3%; 



Score 75.4; DB 1; Length 7218; 
Pred. No. 1.8e-07; 



Matches 22; Conservative 242; Mismatches 153; Indels 0; Gaps 0; 

Qy 112 7 TTAAAGG CTCCC CAAAAGGAGAGT CAGCCATATTAGTAGAAAACACTAAGGAAGAAGTAA 1186 

IIIMI | || | |:: : : :: :::::: : :::::::::: :: 

Db 14 5 9 TTAAAGAGATAGAAGAATTTGGTACRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 14 00 

Qy 1187 TTGTGAGGAGTAAAGACAAAGAGGATTTAGTTTGTAGTG CAG C CCTTCACAGT CCACAAG 124 6 



Db 13 99 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 134 0 

Qy 1247 AATCACCTGTGGGTAAAGAAGACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTA 1306 



Db 133 9 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 128 0 

Qy 13 07 ATGAAATG CAGATGT CAGTAGTAG CACCTGTGAGGGAAGAGTATG CAGACTTTAAG CCAT 13 66 



Db 127 9 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 122 0 

Qy 13 67 TTGAACAAGCATGGGAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTGCTGGCTGCTA 14 26 



Db 1219 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1160 

Qy 1427 GAGCTAATGTGGAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAA 14 86 



Db 1159 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1100 

Qy 1487 GTCTTGGGAAGGATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAG 1543 



Db 1099 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRATCGCAAGCTCCCTCGACCTGCAG 1043 



RESULT 9 

US-09-128-155-16 

; Sequence 16, Application US/09128155 
; Patent No. 6117654 
; GENERAL INFORMATION: 
; APPLICANT: Pan, Yang 

; TITLE OF INVENTION: NOVEL MOLECULES OF TANGO-77 RELATED PROTEIN FAMILY 

; TITLE OF INVENTION: AND USES THEREOF 

; FILE REFERENCE: 09404/052001 

; CURRENT APPLICATION NUMBER: US/09/128,155 

; CURRENT FILING DATE: 1998-08-03 

; EARLIER APPLICATION NUMBER: US 60/091,650 

; EARLIER FILING DATE: 1998-07-02 

; EARLIER APPLICATION NUMBER: US 60/054,646 

; EARLIER FILING DATE: 1997-08-04 

; NUMBER OF SEQ ID NOS : 18 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 16 

LENGTH: 152331 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY: misc_feature 
LOCATION: ( 1 )...( 15233 1 ) 
OTHER INFORMATION: n = A,T,C or G 
US-09-128-155-16 



Query Match 2.0%; Score 75.2; DB 3; Length 152331; 

Best Local Similarity 53.8%; Pred. No. le-06; 

Matches 155; Conservative 0; Mismatches 133; Indels 0; Gaps 



0; 



Qy 463 CCGCCCGCCGCCGCCGCGCCGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCC 522 

MINI III llllll II I I I I I I 

Db 21936 CCCCGCCCCCCCGGCCCGCCCCCCGCGGCCCCCCACCCCCCCCCCCCCCCCCCCGCGCCC 21995 

Qy 523 CGCGGGCCGCTGCCGGCCGCGCCCCCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGC 582 

III II I I III I I III I I I I II I I II II I 

Db 21996 CGCCCCCCCCCCCGCGCCCCCCACCCCCCCGCCCCCCCGCCCCCCCCCCCCCCCCCCACC 22 055 

Qy 583 AGCCCCGCGGCGCCCGCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTC 642 

II I I I I II II III I I I I I III I I II III II II 

Db 22056 CCCCACACCCGGCCCACACGCACCCCCCACCCCGACGCCCCCGCCCCCCCCCCCCCGCAG 22115 



Qy 64 3 CCAGAGGACGACGAGCCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCC 7 02 

II III I III 1 I III I II III I II III III III 

Db 22116 CCGACGCCCCCCCCCCGCCCGCCCCGCCCCGCACCCCCGACCCCCCCCGCCGCCCCGCCC 22175 

Qy 703 CTGGCGGAGCCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

III III Ml Mill II I II MMM I I I 

Db 22176 CCGCCCCCCCCCCCCGGCCCCCCCCCCGCCGGCGCGGCGCCCCACCCC 22223 



RESULT 10 

US-09-894-998A-35/C 

; Sequence 35, Application US/09894998A 

; Patent No. 6537555 

; GENERAL INFORMATION: 

; APPLICANT: Hosken, Nancy Ann 

; APPLICANT: Craig H. Day 

; APPLICANT: Davin C. Dillon 

; APPLICANT: McGowan, Patrick 

; APPLICANT: Sleath, Paul R. 

; TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE DIAGNOSIS AND 
; TITLE OF INVENTION: TREATMENT OF HERPES SIMPLEX VIRUS INFECTION 
; FILE REFERENCE: 210121.538 

; CURRENT APPLICATION NUMBER: US/09/8 94 , 9 98A 
; CURRENT FILING DATE: 2001-06-28 
; NUMBER OF SEQ ID NOS : 64 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 35 

LENGTH: 24 81 

TYPE: DNA 

ORGANISM: HSV-2 
US-09-894-998A-35 



Query Match 2.0%; Score 74.6; DB 4; Length 2481; 

Best Local Similarity 48.7%; Pred. No. 1.6e-07; 

Matches 2 03; Conservative 0; Mismatches 214; Indels 0; Gaps 0; 



Qy 327 GTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAGGAGGAGGACGAGGA 38 6 

llllll MM II I I I I Mill MM llllll Mill 

Db 192 0 GGACGCGGACGCGACGCTCCCACCAGCCCCGCCCGCAGAGGAAGAGGCGGAGGAGGAGGA 18 61 



Qy 3 87 GGAGGACGACGAGGAC CTAGAGGAACTGGAGGTG CTGGAGAGGAAG C CCGCAGCCGGGCT 446 

II III I Mill Mill Mill I MM I III I I II 

Db 1860 GGCGGAGGAGGAGGAGGCGGAGGAGGAGGAGGCGGAGGAGGAGGAGGCGGAGGAGGAGGA 1801 

Qy 44 7 GTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCGCCGCTGCTGGACTTCAGCAGCGACTC 506 

I I I I I II I M I I I III II I I II MM 

Db 18 00 GGCGGAGGAGGAGGAGG CGGAGGAGGAGGAGG CGG CGG CGACCG CGG C CTGGGACGACGG 1741 

Qy 507 GGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCCCCTGCCGCTCCTGAGAGGCA 566 

I II II I I I III I MM II II I III 

Db 174 0 AGACGCCGACGGGGGCGCGGCGCCCGCGGACGCCGGGGCGAGCGGCCCGTGGCCGCGGTC 1681 

Qy 567 GCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCGCCATCCCTGCCGCCCGCTGCCGCAGT 626 

III III I I II I III I I II I Ml I II I 

Db 168 0 GCCCGAGTCCGAGTCCGGGGCCCGGCGCGGCGCCGCCCTCTTGGCCCCCACCCCCTGGGG 1621 

Qy 627 CCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCTCCGGCGAGGCCCCCGCGTCCGCCGCC 686 

I Ml! I III II II III I II Ml 

Db 1620 GG CGAGGGG CGAG CG CGGGG CGG CGGAGGAAGAGG CGGAGG ACG AGG CCGCGGGG C C CGA 1561 

Qy 687 AGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCGCCCCCTTCCACGCCGGCCGCGCC 743 

III I II III II II II Mill MM Ml MM III 

Db 1560 GTCCGACCCGCGCCTCTTCCGGGGGCGGGCCGCCGCCCCCTCCGCGGCGTGGGGGGC 1504 



RESULT 11 

US-09-103-840A-2/C 

; Sequence 2, Application US/09103840A 

; Patent No. 6294328 

; GENERAL INFORMATION: 

; APPLICANT: FLEISCHMAN, Robert D. 

; APPLICANT: WHITE, Owen R. 

; APPLICANT: FRASER, Claire M . 

; APPLICANT: VENTER, John C 

; TITLE OF INVENTION: DNA SEQUENCES FOR STRAIN ANALYSIS IN MYCOBACTERIUM • 
; TITLE OF INVENTION: TUBERCULOSIS 

FILE REFERENCE: 24366-20007.00 
; CURRENT APPLICATION NUMBER: US/09/103 , 840A 
; CURRENT FILING DATE : 1998-06-24 
; NUMBER OF SEQ ID NOS : 2 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 2 

LENGTH: 44 03765 

TYPE: DNA 

; ORGANISM: Mycobacterium tuberculosis 
FEATURE : 

OTHER INFORMATION: CDC 1551 

OTHER INFORMATION: "n" bases at various positions throughout the sequence 
OTHER INFORMATION: represent a, t, c or g 
US-09-103-840A-2 

Query Match 2.0%; Score 73.2; DB 3; Length 44 03765; 

Best Local Similarity 52.3%; Pred. No. 1.8e-05; 

Matches 162; Conservative 0; Mismatches 148; Indels 0; Gaps 0; 



Qy 434 CCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCGCCGCTGCTGGACT 4 93 



Db 3926346 CCGTGCCGGCGCTGCCCGCGCCGCCGGCGCCGCCTTGGCCGCCGGTGCCGCCGATACCGG 
3926287 

Qy 4 94 TCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCCCCTGCCG 553 

I II II I I II I I I I III Mill || || II Ml 

Db 3926286 CCTTGCGCGCGGCGCCGACAACCCCGCCGGTTCCTCCGGTGCCGGCGGCCCCGCCGGCCC 

3926227 

Qy 554 CTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCGCCATCCCTGCCGC 613 

I II I I I I I I I I I I II II I II M II I I I 

Db 3926226 CGCCGGCGCCGGCGTTACCGCCAGTCCCACCCGCGCCGCCGTCGGCGCCAATCCCGCTGG 

3926167 

Qy 614 CCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCTCCGGCGAGGCCCC 673 

I I Ml III III II MM III MM I III 

Db 3926166 CATTATCAGCACCGGAGCCACCCATGCCGCCGGCGCCGCCTTGGCCGCCGGTGCCGCCGG 
3926107 

Qy 674 CGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCGCCCCCTTCCACGC 733 

I II Ml Ml II I Mill Ml II III M I Ml 

Db 3926106 CACCACCGGAGCCGTTGATGCCGCCGGCAATGGCGTTGCCGCCCTGGCCGCCGACGCCGC 
3926047 

Qy 734 CGGCCGCGCC 743 

Mill MM 

Db 3926046 CGGCCCCGCC 3926037 



RESULT 12 

US-09-103-840A-1/C 

; Sequence 1, Application US/09103840A 

; Patent No. 6294328 

; GENERAL INFORMATION: 

; APPLICANT: FLEISCHMAN, Robert D. 

; APPLICANT: WHITE , Owen R. 

; APPLICANT: FRASER, Claire M. 

; APPLICANT: VENTER , John C. 

; TITLE OF INVENTION: DNA SEQUENCES FOR STRAIN ANALYSIS IN MYCOBACTERIUM 
; TITLE OF INVENTION: TUBERCULOSIS 

FILE REFERENCE: 24366-2 0007.00 
; CURRENT APPLICATION NUMBER: US/09/103 , 840A 
; CURRENT FILING DATE: 1998-06-24 
; NUMBER OF SEQ ID NOS : 2 
; SOFTWARE: Patent In Ver. 2.1 
; SEQ ID NO 1 

LENGTH: 4411529 

TYPE: DNA 

ORGANISM: Mycobacterium tuberculosis 
OTHER INFORMATION: H37Rv 
US-09-103-840A-1 

Query Match 2.0%; Score 73.2; DB 3; Length 441152 9; 

Best Local Similarity 52.3%; Pred. No. 1.8e-05; 

Matches 162; Conservative 0; Mismatches 148; Indels 0; Gaps 0; 
Qy 434 CCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCGCCGCTGCTGGACT 4 93 

Ml I I MM MM I III 1 1 1 i 1 1 I I I Mill I I 



Db 3932558 CCGTGCCGGCGCTGCCCGCGCCGCCGGCGCCGCCTTGGCCGCCGGTGCCGCCGATACCGG 
3932499 

Qy 4 94 TCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCCCCTGCCG 553 

I II II I I II I I I I I I I UNI I II II II III 

Db 3932498 CCTTGCCCGCGGCGCCGACAACCCCGCCGGTTCCTCCGGTGCCGGCGGCCCCGCCGGCCC 

3932439 



Qy 554 CTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCGCCATCCCTGCCGC 613 

I II II I I I I II M II Ml Ml I II II I 

Db 3932438 CGCCGGCGCCGGCGTTACCGCCAGTCCCACCCGCGCCGCCGTCGGCGCCAATCCCGCTGG 

3932379 

Qy 614 CCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCTCCGGCGAGGCCCC 673 

I I III III III II II I I III I I I I I III 

Db 3932378 CATTATCAGCACCGGAGCCACCCATGCCGCCGGCGCCGCCTTGGCCGCCGGTGCCGCCGG 

3932319 



Qy 674 CGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCGCCCCCTTCCACGC 733 

I II II I III II I Mill Ml II Ml II I III 

Db 3932318 CACCACCGGAGCCGTTGATGCCGCCGGCAATGGCGTTGCCGCCCTGGCCGCCGACGCCGC 
3932259 

Qy 734 CGGCCGCGCC 743 

Mill MM 

Db 3932258 CGGCCCCGCC 3932249 



RESULT 13 
US-09-165-264-8/C 

; Sequence 8, Application US/09165264 
; Patent No. 6197510 
; GENERAL INFORMATION: 

APPLICANT: Vinayagamoorthy , Thuraiayah 

TITLE OF INVENTION: Multi-Loci Genomic Analysis 
; FILE REFERENCE: 44 747 

; CURRENT APPLICATION NUMBER: US/ 09/165 , 264 
; CURRENT FILING DATE: 1998-10-01 
; NUMBER OF SEQ ID NOS : 14 

SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 8 

LENGTH: 319 

TYPE : DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence : Primer sequence 
US-09-165-264-8 



Query Match 1.8%; Score 68.6; DB 3; Length 319; 

Best Local Similarity 52.2%; Pred. No. 1.3e-06; 

Matches 152; Conservative 0; Mismatches 13 9; Indels 0; Gaps 0; 
Qy 4 54 GCTGCGGTGCCGCCCGCCGCCGCCGCGCCGCTGCTGGACTTCAGCAGCGACTCGGTGCCC 513 

Mill I II III II II M I II I I I I I I I I III 

Db 3 08 GCTGCAGACCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC 24 9 



Qy 514 CCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCCCCTGCCGCTCCTGAGAGGCAGCCATCC 573 



Ill I 1 1 1 1 I II ! II II I Mill Mill I II II 

Db 24 8 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC 189 

Qy 574 TGGGAACGCAGCCCCGCGGCGCCCGCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCC 633 

I I MM I I III I II Ml II III I II I II II! 

Db 188 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC 129 

Qy 634 TCCAAGCTCCCAGAGGACGACGAGCCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGC 693 

II I III II II II I Mill II II II II II I 

Db 128 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC 69 

Qy 694 GCGAGCCCCCTGGCGGAGCCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCC 744 

I Mill I III II I Mill II III Ml Ml 

Db 68 CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC 18 



RESULT 14 
US-09-103-840A-2 

; Sequence 2, Application US/09103840A 

; Patent No. 6294328 

; GENERAL INFORMATION: 

; APPLICANT: FLEISCHMAN, Robert D. 

; APPLICANT: WHITE , Owen R. 

; APPLICANT: FRASER, Claire M. 

; APPLICANT: VENTER, John C. 

; TITLE OF INVENTION: DNA SEQUENCES FOR STRAIN ANALYSIS IN MYCOBACTERIUM 

; TITLE OF INVENTION: TUBERCULOSIS 

; FILE REFERENCE: 24366-20007.00 

; CURRENT APPLICATION NUMBER: US/ 09/ 103 , 84 OA 

; CURRENT FILING DATE: 1998-06-24 

; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 2 

LENGTH: 4403765 

TYPE: DNA 

ORGANISM: Mycobacterium tuberculosis 
FEATURE : 

OTHER INFORMATION: CDC 1551 
; OTHER INFORMATION: "n" bases at various positions throughout the sequence 

OTHER INFORMATION: represent a, t, c or g 
US-09-103-840A-2 

Query Match 1.8%; Score 68; DB 3; Length 44 03765; 

Best Local Similarity 48.9%; Pred. No. 0.00029; 

Matches 216; Conservative 0; Mismatches 22 0; Indels 6; Gaps 1; 
Qy 3 02 CGCCCCGGCCTCCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGG 361 

II I I I MM II II I II I I II M I MM II M 

Db 3925703 CGGCGCCGGTGGCGCCGGCGGTCGCGGCGCACTGCTGCTGGGCGCTGGCGGACAGGGCGG 
3925762 

Qy 3 62 ACGAGGAGGAGGAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGC 421 

I II II I II MM Ml II I I I III I I 

Db 3925763 CCTCGGCGGCGCCGGCGGACAAGGCGGCACCGGCGGGGCCGGCGGAGATGGCGTTCTGGG 
3925822 



Qy 422 TGGAGAGGAAGCCCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCGC 4 81 



Db 3925823 GGGTGTCGGTGGCACTGGTGGTAAGGGCGGTGTCGGCGGCGTGGCTGGCCTCGGCGGGGC 
3925882 

Qy 4 82 CGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCG 541 

II II 1 1 MM II I Ml MM II I MM II I I 

Db 3925883 CGGTGGTGCCGCGGGCCAGCTCTTCAGCGCCGGAGGCGCGGCGGGTGCCGTTGGGGTTGG 
3925942 

Qy 542 CGCCCCCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCGC 601 

II I II MM II III I I III II I II Mill III 

Db 3925943 CGGCACCGGCGGCCAGGGTGGGGCTGGCGGTGCCGGAGCGGCCGGCGCCGACGCCCCCGC 
3926002 

Qy 6 02 CATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCTC 661 

II I I I I III III III I I I I I I I 

Db 3926003 CAGCACAG GTCTAACCGGTGGTACCGGGTTCGCTGGCGGGGCCGGCGGCGTCGG 

3926056 

Qy 662 CGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCGC 721 

MM III I I III Ml IN II I Ml II III II II I 

Db 3926057 CGGCCAGGGCGGCAACGCCATTGCCGGCGGCATCAACGGCTCCGGTGGTGCCGGCGGCAC 
3926116 

Qy 722 CCCCTTCCACGCCGGCCGCGCC 743 

I I Ml I MM M I 

Db 3926117 CGG CGG CCAAGG CGGCG CCGG C 3 926138 



RESULT 15 
US-09-103-840A-1 

; Sequence 1, Application US/09103840A 

; Patent No. 6294328 

; GENERAL INFORMATION: 

; APPLICANT: FLEISCHMAN, Robert D. 

; APPLICANT: WHITE , Owen R. 

; APPLICANT: FRASER, Claire M. 

; APPLICANT: VENTER , John C. 

; TITLE OF INVENTION: DNA SEQUENCES FOR STRAIN ANALYSIS IN MYCOBACTERIUM 

; TITLE OF INVENTION: TUBERCULOSIS 

; FILE REFERENCE: 24366-20007.00 

; CURRENT APPLICATION NUMBER: US /09/103,84 0A 

; CURRENT FILING DATE: 1998-06-24 

; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 1 

LENGTH: 441152 9 

TYPE: DNA 

ORGANISM: Mycobacterium tuberculosis 
OTHER INFORMATION: H37Rv 
US-09-103-840A-1 

Query Match 1.8%; Score 68; DB 3; Length 4411529; 

Best Local Similarity 48.9%; Pred. No. 0.00029; 

Matches 216; Conservative 0; Mismatches 220; Indels . 6; Gaps 1; 



Qy 302 CGCCCCGGCCTCCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGG 361 



II I I I Mil II II I II I I II II I 1 1 1 1 II M 

Db 3931915 CGG CG CCGGTGG CG C CGG CGGT CG CGG CG CACTG CTG CTGGGCG CTGG CGGACAGGG CGG 
3931974 

Qy 362 ACGAGGAGGAGGAGGAGGACGAGGAGGAGGACGACGAGGAC CTAGAGGAACTGGAGGTG C 421 

I II II I II I I I I III II III III I I 

Db 3931975 CCTCGGCGGCGCCGGCGGACAAGGCGGCACCGGCGGGGCCGGCGGAGATGGCGTTCTGGG 

3932034 

Qy 422 TGGAGAGGAAGCCCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCGC 4 81 

II I I I I II I I I I MM I II III I II I II 

Db 3932035 GGGTGTCGGTGGCACTGGTGGTAAGGGCGGTGTCGGCGGCGTGGCTGGCCTCGGCGGGGC 

3932094 

Qy 482 CGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCG 541 

II II II 1 1 1 1 II I III 1 1 1 1 II I 1 1 1 1 II I I 

Db 3932095 CGGTGGTGCCGCGGGCCAGCTCTTCAGCGCCGGAGGCGCGGCGGGTGCCGTTGGGGTTGG 
3932154 

Qy 542 CGCCCCCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCGC 601 

II I II II II II III I I Ml M I II Mill III 

Db 3932155 CGGCACCGGCGGCCAGGGTGGGGCTGGCGGTGCCGGAGCGGCCGGCGCCGACGCCCCCGC 
3932214 

Qy 602 CATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCTC 661 

II I I I I I II I II III I I I I I I I 

Db 3932215 CAGCACAG GTCTAACCGGTGGTACCGGGTTCGCTGGCGGGGCCGGCGGCGTCGG 

3932268 

Qy 662 CGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCGC 721 

MM III I I III III II I I I II II II III II II I 

Db 3932269 CGGCCAGGGCGGCAACGCCATTGCCGGCGGCATCAACGGCTCCGGTGGTGCCGGCGGCAC 
3932328 

Qy 722 CCCCTTCCACGCCGGCCGCGCC 743 

I I III I MM II I 

Db 3932329 CGGCGGCCAAGGCGGCGCCGGC 3 93235 0 
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RESULT 1 

US-09-893-348-17 

; Sequence 17, Application US/09893348 
; Patent No. US20020072493A1 
; GENERAL INFORMATION: 

/ APPLICANT: EI SENBACH- SCHWARTZ , Michal 



APPLICANT: COHEN , I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S- SCHWARTZ =2A 
CURRENT APPLICATION NUMBER: US/09/8 93,348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS: 29 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 17 
LENGTH: 4684 
TYPE: DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (253) . . (3744) 
OTHER INFORMATION: 
US-09-893-348-17 

Query Match 100.0%; Score 3739.4; DB 9; Length 4 684; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 3740; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 ATTGCTCGTCTGGGCGGCGGCGGCGGCTGCAGCCTGGGACAGGGCGGGTGGCACATCTCG 60 

II MUM MUM MMM II MMM IMMMII MUM MUM 

Db 1 ATTG CTCGTCTGGG CGG CGG CGG CGGCTGCAG CCTGGGACAGGG CGGGTGG CACATCT CG 60 

Qy 61 ATCGCGAAGGCAGCAGAAGCAGTCTCATTGTTCCGGGAGCCGTCGCCTCTGCAGGTTCTT 12 0 

IMIIIIIIIIII IIIIIIIMIIIMIMIIIIIIIMMIIIIIIIIIIIIMIIII 

Db 61 ATCGCGAAGGCAGGAGAAGCAGTCTCATTGTTCCGGGAGCCGTCGCCTCTGCAGGTTCTT 12 0 

Qy 121 CGGCTCGGCTCGGCACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACA 180 

IMIIMM MMM IMMMII MMM IMMMMMIMM MMM 

Db 121 CGGCTCGGCTCGGCACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACjr 180 

Qy 181 ACCGCCCGCGACTCTGAGGAGAAGCGGCCCTGCGGCGGCTGTAGCTGCAGCATCGTCGGC 24 0 

Mill MMIMIMM llllllll IN INNINMIMMMIMM MINI!! 

Db 181 ACCGCCCGCGACTCTGAGGAGAAGCGGCCCTGCGGCGGCTGTAGCTGCAGCATCGTCGGC 24 0 

Qy 241 GACCCGCCAGCCATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGC 3 00 

IMM MMM IMMMII llllll MIMIIM llllll M IIIMM 

Db 241 GACCCGCCAGCCATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGC 3 00 

Qy 3 01 CCGCCCCGGCCTCCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAG 3 60 

Mill llllll MIMIIMMIIMI IMIIIMI MMII IIMM MM 

Db 3 01 CCGCCCCGGCCTCCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAG 3 60 



Qy 3 61 GACGAGGAGGAGGAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTG 420 

I Mill 1 1 1 1 1 ! 1 1 1 ; 1 1 ; 1 1 [ 1 1 1 1 1 ! I M i 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 361 GACGAGGAGGAGGAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTG 42 0 

Qy 421 CTGGAGAGGAAGCCCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCG 48 0 

1 1 1 1 i I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 . 1 : 1 1 : 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 

Db 421 CTGGAGAGGAAGCCCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCG 48 0 

Qy 481 CCGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCC 54 0 

1 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 Ml Mill; 1 1 1 1 : 1 1 1 1 1 1 1 lit III 1 1 

Db 4 81 CCGCTGCTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCC 54 0 

Qy 541 GCGCCCCCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCGCCGCGGCGCCCGCG 600 

Mill II IIMIMIMI MMIM MIMI MIMI MIMIMMM 

Db 541 GCGCCCCCG CCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCG 600 

Qy 601 CCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCT 660 

IIIIIIIIIMIIMIMIIIIIillllllMIIIIIIIIIIIIIIIIIIMIIIIIIII 

Db 601 CCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCT 660 

Qy 661 CCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCG 72 0 

iiiii i! iiiiii iiiiii Mini ii iiiiM nMiiiii iiiiii i 

Db 661 CCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCG 72 0 

Qy 721 CCCCCTTCCACGCCGGCCGCGCCCAAGCGCAGGGGCTCCGGCTCAGTGGATGAGACCCTT 78 0 

Mill ll MIMI MUM MIMI II IMIII IMIMI IMMIMM 

Db 721 CCCCCTTCCACGCCGGCCGCGCCCAAGCGCAGGGGCTCCGGCTCAGTGGATGAGACCCTT 78 0 

Qy 781 TTTGCTCTTCCTGCTGCATCTGAGCCTGTGATACCCTCCTCTGCAGAAAAAATTATGGAT 84 0 

IMMMUIMM MMMMIMI MMIMMMIMI IIMIMM 

781 TTTGCTCTTCCTGCTGCATCTGAGCCTGTGATACCCTCCTCTGCAGAAAAAATTATGGAT 84 0 

Qf 841 TTGATGGAGCAGCCAGGTAAC7VCTGTTTCGTCTGGTCAAGAGGATTTCCCATCTGTCCTG 900 

I I I I I 1 I I ! I I I I I I I I I I I I I I I I I II I I I I I ! I I I I I I I I I I I I I I I I I I I I I II I 1 I 

Db 841 TTGATGGAGCAGCCAGGTAACACTGTTTCGTCTGGTCAAGAGGATTTCCCATCTGTCCTG 900 

Qy 901 CTTGAAACTGCTGCCTCTCTTCCTTCTCTATCTCCTCTCTCAACTGTTTCTTTTAAAGAA 960 

IMIMM IIIIII IMIII MUM II MIMI MMIM MIMI M 

Db 901 CTTGAAACTGCTGCCTCTCTTCCTTCTCTATCTCCTCTCTCAACTGTTTCTTTTAAAGAA 960 

Qy 961 CATGGATACCTTGGTAACTTATCAGCAGTC 102 0 

MlllMi IIIIII llllli IIIIII ll-IMIII I IIIIM IIIMIil 

Db 961 CATGGATACCTTGGTAACTTATCAGCAGTGTCATCCTCAGAAGGAACAATTGAAGAAACT 102 0 

Qy 1021 TTAAATGAAG CTTCTAAAGAGTTG CCAGAGAGGG CAACAAATCCATTTGTAAATAGAGAT 108 0 

Mill II MIMI IIIIII IIIIII M MUM MMIIMI IIMIMM 

Db 1021 TTAAATGAAG CTTCTAAAGAGTTG CCAGAGAGGG CAACAAATC CATTTGTAAATAGAGAT 108 0 

Qy 1081 TTAGCAGAATTTTCAGAATTAGAATATTCAGAAATGGGATCATCTTTTAAAGGCTCCCCA 114 0 

i ! 1 1 < i - ! I i I , I I I I I ! I ; I I j I I I [ . I I I I ! I III MINI I I 

Db 1081 TTAGCAGAATTTTCAGAATTAGAATATTCAGAAATGGGATCATCTTTTAAAGGCTCCCCA 114 0 

Qy 1141 AAAGGAGAGTCAGCCATATTAGTAGAAAACACTAAGGAAGAAGTAATTGTGAGGAGTAAA 1200 

I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 AAAGGAGAGTCAGCCATATTAGTAGAAAACACTAAGGAAGAAGTAATTGTGAGGAGTAAA 1200 



QY 



12 01 GACAAAGAGGATTTAGTTTGTAGTGCAGCCCTTCACAGTCCACAAGAATCACCTGTGGGT 126 0 



IIIIIIIIIIMIIIIIIIIMIMIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIII 

Db 1201 GACAAAGAGGATTTAGTTTGTAGTGC^ 1260 

Qy 1261 AAAGAAGACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATG 1320 

I III IM III MIMIIMm MMMMMMMMM III II! Ml I 

Db 1261 AAAGAAGACAGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATG 132 0 

Qy 1321 TCAGTAGTAG CACCTGTGAGGGAAGAGTATG CAGACTTTAAG CCATTTGAACAAGCATGG 138 0 

M III III II! IMIMI Ml lillllllllllll Ml IMMM IMMI 

Db 1321 T CAGTAGTAG CAC CTGTGAGGGAAGAGTATG CAGACTTTAAG CCATTTGAACAAGCATGG 138 0 

Qy 1381 GAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTGCTGGCTGCTAGAGCTAATGTGGAA 144 0 

MM! Ill MIMMMIM III IMMMMMMM IM !M Ml III I 

Db 1381 GAAGTGAAAGATACTTATGAGGGAAGTAGGGATGTGCTGGCTG CTAGAG CTAATGTGGAA 144 0 

Qy 1441 ACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGGAAGGAT 1500 

1 1 ill 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 - 1 ^. i . 1 1 ! , i 

Db 1441 AGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGGAAGGAT 1500 

Qy 15 01 AGTGAAGGCAGAAATGAGGATG CTTCTTT C CCCAGTAC C CCAGAAC CTGTGAAGGACAGC 1560 

I MM MM MM MM MM IMIMI M' Ml IM III Ml Ml MM II 

Db 1501 AGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGACAGC 1560 

Qy 1561 TCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCGAAAGCACCACAGCAAAC 162 0 

MMMMMMMIMMIMM ill III IM IMMM III MIMIMI 

Db 1561 TCCAGAGCATATATTACCTGTGCTTCCTTTACCTCAGCAACCGAAAGCACCACAGCAAA 1620 

Qy 1621 ACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAAATAGAA 168 0 

MMMIIMM nl lllllll Ml IMMM IM IM MMMMIMM 

Db 1621 ACTTT CC CTTTGTTAGAAGAT CATACTTCAGAAAATAAAACAGATGAAAAAAC C TAGAA 168 0 

Qy 1681 GAAAGGAAGG CCCAAATTATAACAGAGAAGACTAG CCC CAAAACGT CAAATCCTTT CCTT 174 0 

MIMMIMM IM IMMM Ml IM Ml III III MMIIIIIIIII 

Db 1681 GAAAGGAAGGCCCAAATTATAACAGAGAAGACTAGCCCCAAAACGTCAAATCCTTTCCTT 174 0 

Qy 1741 GTAGCA,GTACAGGATTCTGAGGCAGATTATGTTACAAC^GATACCTTATCAAAGGTGACT 1800 

IMMMMMMM IMMM M Ml IMMM III IMMI IMMM 

Db 1741 GTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCAAAGGTGACT 180 0 

Qy 1801 GAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAG 186 0 

MMIMMMI III Ml III IM III IM MMMMMMMMMMM 

Db 1801 GAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAGGAAGCATGT 186 0 

Qy 1861 GAAAGTGAACTGAATGAAGCCACAGGTAC7VAAGATTGCTTATGAAACAAAAGTGGACTTG 192 0 

lllllllillll III Ml III III III III MMMMMMMMMMM 

Db 1861 GAAAGTGAACTGAATGAAGCCACATTTACAAAGATTGCTTATGAAACAAAAGTGGACTTG 192 0 

Qy 1921 GTCCAAACATCAGAAGCTATACAAGAATCACTTTACCCCACAGCA.C^^ 198 0 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIMIIIIIIIMIMI 

Db 1921 GTCCAAACATCAGAAGCTATACAAGAATCACTTTACCCCA 198 0 

Qy 1981 TTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGCACCA 2 04 0 

MlMMIMM III IM'MIM M !M IMMM III IMIIIIIMIII 

Db 1981 TTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATGGAAGCACCA 2 04 0 

Qy 2041 TTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATCCCCACTG 2100 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 ; I i 1 1 I i M I 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 



Db 



2 041 TTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTATCCCCACTG 210 0 



Qy 2101 GAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAACCCCCCACCA 2160 

1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 ; 1 1 1 1 ! I i i 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 i I 1 1 1 1 1 1 1 1 I ! ! I 

Db 2101 GAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAACCCCCCACCA 216 0 

Qy 2161 TATGAAGAAGCCATGAATGTAGC^CTAAAAGCTTTGGGAAC^W^GGAAGGAATAAAAGAG 222 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIMIIIIIIIIIIIII 

Db 2161 TATGAAGAAGCCATGAATGTAGCACTAAAAGCTTTGGGAACAAAGGAAGGAATAAAAGAG 222 0 

Qy 2221 CCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATATCCATTGCGTGT 228 0 

MMMMMMMMMIMM MM lllllllll MIMMII IMMIII 

Db 2221 CCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATATCCATTGCGTGT 228 0 

Qy 22 81 GATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTCTCTAATTATTCA 234 0 

' M 1 1 1 1 1 M 1 1 1 M I III M 1 1 1 1 1 MM 1 1 1 Ml 1 1 M Ml I Ml M M 1 1 1 1 1 12430 

Db 22 81 GATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTCTCTAATTATTCA 234 0 

Qy 2341 GAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTGGAGGATTCCTCA 24 00 

IMIMM IMIIMM Mill Mill MIMIIIIIIIIIIMMIIIIIIIM 

Db 2341 GAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTGGAGGATTCCTCA 24 00 

Qy 24 01 CCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCCACAAACA 24 6 0 

1 1 1 : 1 1 1 ; 1 1 i ; 1 1 1 1 : 1 1 1 : ; 1 1 1 I i I S I i I ; i 1 1 1 1 S 1 1 1 1 1 1 1 i 1 1 1 1 1 i : i 

Db 24 01 CCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCCACAAACA 24 60 

Qy 24 61 CAAGAGGAGG CTGTGATG CTCATGAAGGAGAGTCTCACTGAAGTGTCTGAGACAGTAG C C 252 0 

II: MM IMMIMI MM ;MI MM MMMMMMMMMIIMM 

Db 24 61 CAAGAGGAGG CTGTGATG CT CATGAAGGAGAGTCT CACTGAAGTGTCTGAGACAGTAG C C 252 0 

Qy 2521 CAGCACAAAGAGGAGAGACTTAGTGCCTCACCTCAGGAGCTAGGAA ATASA® 258 8 

IM III! IMMIIM IIMMMi MMMMIMMMMMMIMIMM 

Db 2 521 CAGCACAAAGAGGAGAGACTTAGTGCCTCACCTCAGGAGCTAGGAAAGCCATATTTAGAG 258 0 

Qy 2 581 TCTTTTCAGCCCAATTTACATAGTACAAAAG 264 0 

IMIMM IIMMMI MIM Ml IMMIIMIIIIMIIIIIIIIIIIMM 

Db 2 581 TCTTTTC^GCCCAATTTACATAGTACAAAAGATGCTGCATCTAATGACATTCCAACATTG 264 0 

Qy 2641 ACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTTAATACTGCAATTTATTCAAAT 270 0 

M 1 1 M 1 1 1 1 1 1 II 1 1 MM 1 1 1 1 1 1 MMI I II M 1 1 1 1 1 1 1 1 MM 1 1 1 1 1 II II 

Db 2641 ACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTTAATACTGCAATTTATTCAAAT 2700 

Qy 27 01 GATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTGAAAQ^T^ 276 0 

II lllllll Mill irilllllllll MINI III lllllilllll III IIMIIII 

Db 2701 GATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTGAAACATTTTCAGATTCA 2760 

Qy 2 761 TCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGCTAAAGATGATTCTCCT 282 0 

1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 f i 1 1 1 1 

Db 2 761 TCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGCTAAAGATGATTCTCCT 282 0 

Qy 2821 AAATTAGCCAAGGAGTACACTGATCTAGAAGTATCCGACAAAAGTGAAATTGCTAATATC 2880 

IMIMM llllllil MIMMIMI llllll MIMIMMMIMI lllllllll 

Db 2821 AAATTAGCCAAGGAGTACACTGATCTAGAAGTATCCGACAAAAGTGAAATTGCTAATATC 288 0 

Qy 2 881 CAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAAT 2 94 0 

III MIMMMMIM MM MM IMMIIM IMIMIII MM MM 

Db 28 81 CAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGTGACCTTTCTTTCAAGAAT 2 94 0 



Qy 2 941 ATATATCCTAAAGATGAAGTACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTA 3 000 

1 1 1 : 1 1 1 1 1 1 I ! 1 1 ' Ml 1 1 1 . 1 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 2 941 ATATATCCTAAAGATGAAGTACATGTTTCAGATGAATTCTCCGAAAATAGGTCCAGTGTA 3 000 

Qy 3 001 TCTAAGGCATCCATATCGCCTTCAAATGTCTCTGCTTTGGAACCTCAGACAGAAATGGGC 3 060 

IMIIIIIIIIIIIIIIIMMIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 3 001 TCTAAGGCATCCATATCGCCTTCAAATGTCTCTGCTTTGGAACCTCAGACAGAAATGGGC 3 060 

Qy 3 061 AGCATAGTTAAATCCAAATCACTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTGACACA 3120 

IMMIMI IMIM II IMIM III III MM III IMIM lllllll 

Db 3 061 AGCATAGTTAAATCCAAATCACTTACGAAAGAAGCAGAGAAAAAACTTCCTTCTGACACA 3120 

Qy 3121 GAGAAAGAGGACAGATCCCTGT CAG CTGTATTGTCAG CAGAG CTGAGTAAAACTTCAGTT 318 0 

IMMMII IMIM III MMM IMMIIMI IM MMM lllllll 

Db 3121 GAGAAAGAGGACAGAT CC CTGT CAG CTGTATTGTCAG CAGAG CTGAGTAAAACTTCAGTT 318 0 

Qy 3181 GTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTA 324 0 

IMMMII MM I Ml MMM MIMIIMI IM IMIM lllllll 

Db 3181 GTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGCTTA 324 0 

Qy 3241 TTCCTGCTGCTGTCTCTGAt^GTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTG 33 00 

IMMIMI IMIM I! MMM IIIMMIII III II MMMIIMI 

Db 3241 TTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCCTTG 33 00 

Qy 33 01 GCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAG 33 60 

IMMIMI I MM M IMIM IIIMMIII III II III lllllll 

Db 33 01 GCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATCCAG 33 60 

Qy 3361 AAAT(^GATGAAGGCCACCC^TTCAGGGCATATTTAGAATCTGAAGTTGCTATATCAGAG 3420 

IMMIMI I MM II llllll MIIIIMM III II III lllllll 

Db 3361 AAATCAGATGAAGGCCACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATAT CAGAG 3420 

Qy 3421 GAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTC^TGTGAACAG(^CAATAAAAG^ 34 80 

MIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIMMIIIIIIII 

Db 3421 GAATTGGTTC^GAAATACAGTAATTCTGCTCTTGGTC^TGTGAACAGCAC^TAAAAG^ 3480 

Qy 3481 CTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATG 354 0 

IMMIMI Ml MMM IIIIMMM IMIM Ml MUM IIIMII 

Db 3481 CTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTGATG 354 0 

Qy 3541 TGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTG 3600 

llllllll IMM Ih IMM IIIIMMIIliM IMIIIIIIII IIIMIIIIIIII 

Db 3541 TGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCTCTG 3 600 

Qy 3601 ATCTCA.CTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCATTAT 3 660 

I lllllll IMMIII|075X2||||| Ml IMIIIMIilMI IMM Ml IIIMII 385 
Db 3601 ATCTC^CTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCATTAT 3660 

Qy 3661 CTAGGACTTGCAAACAAGAGTGTTAAGGATGCCATGGCCAAA^ 3720 

III MMM III MMM IIMMIMI IM llllll IMIIIMIIMI 

Db 3661 CTAGGACTTGCAAACAAGAGTGTTAAGGATGCCATGGCCAi^ 3720 

Qy 3721 GGATTGAAGCG CAAAG CAGAT 3 741 

Ml IM I:' IM IMM 

Db 3721 GGATTGAAG CG CAAAG CAGAT 3741 
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Qy 134 CACGACTCGGCCTGCCTGGCCCCTGC(^GTCTTGCC(^CCCCCAC^ACCGCCCGCGACT 193 

I II III I I III MINI I I II I I IIIIIIIIIIIIIIIIIMIIII II 

Db 16 CTCGGCTC^GTCGGCCt^GCCCCTCTCAGTCCTCCCC^CCCCavCAACCGCCCGCGGCT 75 

Qy 194 CTGAGGAGAAGCGGC - CCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCGACCCGCCAGCC 252 

IIIII IIIII Mini I II :M II II I 1 1 1 1 MINI 

Db 76 CTGAGACGCGGCCCCGGCGGCGGCGGCAGCAGCTGCAGCATCATC - TCCACCCTCCAGCC 134 

Qy 253 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 312 

I I I I I I I I I I I IWtWT | | | | | | | | | | M I I I I I I I I I I I I 

Db 13 5 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 191 

Qy 313 CCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAG 372 

I Ml I 1 1 1 M 1 1 1 1 1 1 ; 1 1 1 1 1 II III IIIMM II II II Ml 

Db 192 CAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG GAAGAAGAG 248 

Qy 373 GAGGAGGACGAGGAGGAGGACGACGAGGAC CTAGAGGAACTGGAGGTG CTGGAGAGGAAG 432 

IMIIMI IMMIII II Mill IIIII Mill i 

Db 24 9 GAGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAGCTGGAGGTGCTGGAGAGGAAG 3 08 



Qy 



433 CCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCGCCGCGCCGCTG 486 



Mill Mill MINIM II I MM I II IMIMI MM III 

Db 3 09 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 368 

Qy 4 87 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 54 6 

MUM I I MM Mill II MINI! II 1 1 MIIMIMM III 

Db 369 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 42 8 
Qy 54 7 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 597 

II I III II Ml IMIMI II Mill I Ml I I I MM 

Db 42 9 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 48 8 

Qy 598 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

MIIIIMM II I I M MM III 1 1 1 1 1 i 1 1 i i ; 1 1 IMIMI I 

Db 48 9 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 54 8 
Qy 9 6BKGCCTCC50X AGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

IIIMIII MM II Mill II M III III IMIMII III III 

Db 54 9 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 608 
Qy 712 CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

III IMIMI Mill MIMMMMMIMM 

Db 609 TGGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 668 
Qy 751 AGGGGCTCC GGCTGAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 8 07 

M Mill MIIIMI MM IMIMII IMIMM MM MIMIII I 

Db 669 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 72 8 

Qy 8 08 GTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAGGTAACACTGTT 867 

IMIMI II IMIMII I 1 1 II 1 1 MM MMMIMIMMIMM II 

Db ag2 9 GTGATACG CT C CT CTG CAGAAAA TATGGACTTGAAGGAGCAGCCAGGTAACAC 0 XrT 78 5 

Qy 8 68 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 927 

III 1 1 1 1 . 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : i 1 1 1 1 1 i lllllillllll 

Db 78 6 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 845 

Qy 92 8 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 98 7 

II IMIMI II I I MIMI IIMMIMI III III; M Ml II 

Db 846 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 905 

Qy 988 GTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 1047 

IMIMI IMIMII II MIMI I I I 1 1 1 1 f I i I i 1 1 i 1 1 1 i I II 

Db 9 06 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 965 

Qy 1048 GAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAGAATTAGAATAT 1107 

MM Mill II II I I II MIIMIMM MM 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 

Db 966 GAGAAGGCAAAAACTCTACTCATAGATAGAGA CAGAGTTTTCA^AAESAGAATAC 1025 

Qy 1108 TCAGAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCATATTAGTAGAA 1167 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I Ml IMIMI Ml II Ml II MIMI I 

Db 1026 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 1085 
Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGT 1224 

II III MIMM MM MM I MM II Mill I MIMI Ml I 

Db 108 6 AAT CCTAGGGAAGAAATAAT CGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 114 5 
Qy 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 12 69 

MIMI I II MIMI I MM III MIMI II 



Db 



114 6 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1205 



Qy 1270 AGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTAGTA 132 9 

Illlllllll lllllll III I I I I IIIIIIIMII I I I I I I I I 

Db 12 06 GAAGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTGGAA 1265 

Qy 1330 G CAC CTGTGAGGGAAGAGTATG CAGACTTTAAG CCATTTGAACAAGCATGGGAAGTGAAA 138 9 

II 1 1 1 lllllll II IIIIIIIMII M Illlllll I II MIMMMMM 

Db 1266 G CTCCTATGAGGGAGGAATATG CAGACTTCAAACCATTTGAG CGAGTATGGGAAGTGAAA 1325 

Qy 1390 GATACTTATGAGGGAAGTAGGGATGTG CTGG CTGCTAGAG CT AATGTG 1437 

MM I III I Ml Ml II Illlllll III I II II 

Db 1326 GATA - - - GTAAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAACTTG 1382 

Qy 1438 GAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGGAAG 14 97 

|| llllllllllll | IMIM II I I Illlllll MUM |ur<H4culdO| I 
Db 13 83 GAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAAAAA 1442 

Qy 14 98 GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 1557 

Illlllll I II Illll Ml II MINIMI MUM I I Illll 

Db 1443 GATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAGGAT 1502 
Qy 1558 AGCTCCAGAGCATATATTACCTGTGCTTCCTTTA CCTCAGCAACCGAAAGCACCACA 1614 

I II MIMIIMI II IMIM IIIIII I lllllll II MM M 

Db 1503 CGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCA 1562 

Qy 1615 GCAAACACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAA 1674 

IIIIII III IIMMIMI Illll lllllllllllllll M IIIIIIIIIIM 

Db 1563 ACAAACATTTTTCCTTTGTTAGGAGATCGTACTTCAGAAAATAAGACGGATGAAAAAAAA 1622 

Qy 1675 ATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAG- ACTAGCCCCAAAACGTCAAAT 1731 

Illlllllll llllllllllll lllllll IIIIII IIMIII Mill 

Db 1623 ATAG AAGAAAAGAAGG C C CAAATAG TAA CAGAGAAGAATA C TAG CA C CAAAACAT CAAA C 1682 

Qy 1732 CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 1791 

Illll MIMIIMI MIMMMMM 1 1 1 1 II 1 1 i I lllllll III II 

Db 1683 CCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTAACA 1742 

Qy 1792 AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGAGGCCAGATTTAGTTCAG 1851 

MMMMIMM II Ml III Ml IIIIII Mill MIMIIMI! Ill 

Db 1743 AAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTACAG 18 02 

Qy 18 52 GAAGCATGTGAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAA 1911 

II Ml IIIIII II 1 1 1 M 1 1 1 1 I Ml Ml IIIIII Ml MINI I 

Db 18 03 GAAG CATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTG CTTATGAAACAAAA 18 62 

Qy 1912 GTGGA TCCAAACATCAGAAGCTATACAAGAATCACTTTACCCCACAGCACAGCTT 1971 

I ! 1 1 1 1 1 1 1 MMMMIMM III Mill Illll II M M IMIM I 

Db 1863 ATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAGCTT 1922 

Qy 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2 031 

MIMIIMI II I Illll Mill IIIMIIIIIIMIIII IMIM 

Db 1923 TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 1982 

Qy 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2 091 

MMMIIMI MMM MM M Illlllll II I IIIIII I 

Db 1983 GAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCA 2 042 



Qy 2 092 TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 

II III I I I I I II II llllll llllll II Mill I lllllllllllll 

Db 2 043 TCACCATTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAAC 209 9 

Qy 2152 CC CCCACCATATGAAGAAG C CATGAATGTAG CACT AAAAG CTTTGGGAACAAAGGAA 2208 

MINI II Mill lllllll III! 1 1 1 1 INN I MM llllll 

Db 2100 CCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAGGAA 215 9 

Qy 22 09 GGAATAAAAGAGCCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATA 2268 

I III II IIMM M I lllllllllll MM 1 1 1 1 ! 1 1 1 1 1 1 1 i 1 1 E 1 1 1 1 ] 

Db 2160 GAAATTAAAGAG C CTGAAAATATTAATG CAG CTCTTCAAGAAACAGAAGCTCCTTATATA 2219 

Qy 2269 TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 2328 

II Mill MIMMIMM MIIIIMM II MM III III Ml 

Db 2220 TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 2279 

Qy 2329 TCTAATTATTCAGAAATAG CAAAATT CGAGAAGTCGGTG C C CGAACACGCTGAG CTAGTG 238 8 

III MM Mill llllll I II III Mill II II I 1 1 1 1 : 1 1 1 

Db 22 80 TCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTT 233 9 

Qy 2389 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 244 8 

II llllllllll MUM IIMM MMM II Mill II Mill 

Db 234 0 GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 239 9 

Qy 244 9 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGA A 2502 

M lllllll IIMM II III MM MM II IMMIIMM I 

Db 24 00 GTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCA 245 9 

Qy 2503 GTGTCTGAGACAGTAGCCCAGCACAAAGAGGAGAGACTTAGTGCCTCACCTCAGGAGCTA 2562 

I llllll II MM I III Mill I II I III I 

Db 24 60 TTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGCCACCTGAGGGA 2519 

Qy 2563 GGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCA 2619 

I llllll Ml M llllll III M MM III MMM I I 

Db 252 0 GGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTGTTA 257 9 

Qy 262 0 TCTAATGACATTCCAACATTGACCAAA7UVGGAGAAAATTTCTTTGCAAATGGAAGAGTTT 267 9 

II 1 1 1 1 II lllllllll llllllllllllllll lllllll Mill III I 

Db 258 0 CCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAGGAGCTC 263 9 

Qy 2680 AATACTG CAATTTATTCAAATGATGACTTACTTTCTT CTAAGGAAGACAAAATAAAAGAA 273 9 

I MMM Mill M MMM M II IMMIIMM I MM MM 

Db 2 64 0 AGTACTG CAGTTTATTCAAATGATGACTTATTTATTT CTAAGGAAG CACAGATAAGAGAA 2699 

Qy 2 74 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 279 9 

I IIIIa MM MM II MM II Mill II II II III I II II II II 4t 

Db 27 00 ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 275 9 

Qy 28 00 AGTGCTAAAGATGATTC T CCTAAATTAG CCAAGGAGTACACTGATCTAGAAGTATCC 2856 

III Mill llllll I IMMMIMM III II Mill I IIMM M 

Db 2760 AGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 2819 

Qy 28 57 GACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTG 2916 

IIIIIIIIMIIIIIIMM II II II I MMM ! lllllll 

Db 282 0 CACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTG 287 9 



Qy 2 917 CCCTGTGAC CTTT CTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 297 0 

Ml IIIMIIIIIII lllll III I II MM lllll I I I III 

Db 2880 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCA 293 9 

Qy 2 971 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTOVAATGTC 303 0 

Mill II II lllll MM I Ml MM I III MM II MM 

Db 2 94 0 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 2 99 9 

Qy 3 031 TCTGCTTTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCAAATCACTTACGAAA 309 0 

1 1 1 1 1 1 1 1 1 1 MM MM II I ! 1 1 1 1 1 1 1 1 1 1 i Mill Ml MM 

Db 300 0 TCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAA 305 9 

Qy 3 091 GAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTA 3150 

Mill MIMIMMIIIMM II lllll Ml Mill II I II Ml I! 

Db 3060 GAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCTGCTATA 3119 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

M IIIIIMIMIIMMIIIIIMIIIIIIIIIIIIIII 

Db 312 0 TTTT CAGCAGAG CTGAGTAAAACTT CAGTTGTTGACCT CCTGTACTGGAGAGACATTAAG 3179 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 3270 

II II I II II I II II 1 1 II I II 1 1 1 M I 1 1 i 1 : 1 1 1 1 1 1 1 II MIMI: MIMI 

Db 318 0 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGC 323 9 

Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 3330 

lllll II lllll IIIIIMIIIIIIIIIIIIIIIIIII lllll I ! I I 

Db 324 0 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 32 99 

Qy 3331 ATATATAAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 33 90 

Mill Mill 1 1 f I f 1 1 1 IMIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIM 

Db 33 00 ATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 3359 

Qy 33 91 TATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCT 34 50 

Ml I MMMMMMIIIIMM Mill IMIMIMM III MM! MM 

Db 3360 TATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCT 3419 

Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3 510 

1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 lllll Mill lllll II I I I I I 

Db 3420 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 34 79 

Qy 3511 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 3570 

MIMIII I I I : I I I M Mill I I I I .1 

Db 34 80 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 3 539 

Qy 3 571 AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 3630 

MM I III I IN II |g I Mill || II II MM II II Ml Ml MIMIII 3i65 

Db 3 54 0 AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 3 5 99 

Qy 3631 GAACGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGAT 3 690 

1 1 M II M I II 1 1 MMMMMMMMIIMMMIMM MM MIMI Ml 

Db 3 6 00 GAACGGCATCAGG CACAGATAGATCATTAT CTAGGACTTG CAAATAAGAATGTTAAAGAT 3 659 

Qy 3691 GCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGA 374 0 

II Mill MMMMMIMMMMMIMMIMMMIMM M 

Db 3 660 G CTATGG CTAAAATCCAAG CAAAAATCC CTGGATTGAAG CG CAAAG CTGA 3 709 



RESULT 3 

US-09-972-599A-5 

Sequence 5, Application US/09972599A 
Patent No. US20020077295A1 
GENERAL INFORMATION: 
APPLICANT : STR I TTMATTER , STEPHEN M . 

TITLE OF INVENTION: NOGO- RECEPTOR -MEDIATED BLOCKADE OF AXONAL GROWTH 
FILE REFERENCE: C077 CIP US 

CURRENT APPLICATION NUMBER: US/09/972 , 599A 
CURRENT FILING DATE: 2001-10-06 
PRIOR APPLICATION NUMBER: PCT/USOl/01041 
PRIOR FILING DATE: 2001-01-12 
PRIOR APPLICATION NUMBER: 09/758,140 
PRIOR FILING DATE: 2001-01-12 
PRIOR APPLICATION NUMBER : 60/236,378 
PRIOR FILING DATE: 2000-09-29 
PRIOR APPLICATION NUMBER: 60/207,366 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: 60/175,707 
PRIOR FILING DATE: 2000-01-12 
NUMBER OF SEQ ID NOS : 57 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 5 
LENGTH: 4 053 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (135) . . (3710) 

OTHER INFORMATION: Human DNA encoding for No. US200200772 95Alo protein 
(KIAA0886, GenBank 

OTHER INFORMATION: Accession No. US20020077295A1 AB020693) 
US-09-972-599A-5 

Query Match 62.6%; Score 2343.6; DB 9; Length 4053; 

Best Local Similarity 81.3%; Pred. No. 0; 

Matches 3017; Conservative 0; Mismatches 574; Indels 119; Gaps 21; 
Qy 134 CACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACAACCGCCCGCGACT 193 

I II III I I III llllll Mill I I IIIMI Illlllllllll II 

Db 16 CTCGGCTCAGTCGGCCCAGCCCCTCTCAGTCCTCCCCAACCCCCACAACCGCCCGCGGCT 75 

Qy 194 CTGAGGAGAAGCGGC - CCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCGACCCGCCAGCC 252 

Mill I II I I llllllll I Illlllllllll II I MM MUM 

Db 76 CTGAGACGCGGCCCCGGCGGCGGCGGCAGCAGCTGCAGCATCATC-TCCACCCTCCAGCC 134 

3 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTfiGCBfiGfiTGGAGaGCCCGCCCCGGCGr 312 

II 1 1 1 1 1 1 1 I llllllll I Mill MIM IIMIIIIII llllllll 

Db 135 ATGGAAGACCTGGACCAGTCTCCTCTGGT-- -CTCGTCCTCGGACAGCCCACCCCGGCCG 191 

Qy 313 CCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAG 372 

I llllll 1 1 1 1 E 1 1 1 1 1 f 1 1 1 1 1 r 1 1 IMIII III II Ml! M II III 

Db 192 CAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAG CCCGAGGACGAGGAG GAAGAAGAG 24 8 

Qy 373 GAGGAGGACGAGGAGGAGGACGACGAGGAC CTAGAGGAACTGGAGGTG CTGGAGAGGAAG 4 32 

IIIIIMI llllllll II Mill Mill Mill IMIIIIIIIIIIIIIIIMI 

Db 24 9 GAGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAGCTGGAGGTGCTGGAGAGGAAG 308 



Qy 433 CCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCGCCGCGCCGCTG 486 

Mill IMMIIIIIIMI II I 1 1 1 1 I II lllllll IMIM III 

Db 3 09 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 3 68 

Qy 487 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 54 6 

MIMIII I I MM Mill II MIIIMI II II IMMIIMM Ml 

Db 369 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 42 8 
Qy 54 7 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 5 97 

II I III II III lllllll II II I Mill I I I MM 

Db 42 9 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 488 

Qy 598 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

1 1 1 1 1 1 : 1 II I I III lllllll hi 'II MM I MM IMIM 

Db 48 9 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 54 8 
Qy 658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

MIMIII MM II Mill II II Ml III MIIIMI III III 

Db 54 9 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 608 
Qy 712 CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

IMIMI III Mill MIIIIMM MMI 

Db 609 TGGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 668 
Qy 751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 8 07 

Illllllll Ml MMI Ml MM IIMIMI MIMIII MM III M 

Db 669 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 72 8 

Qy 808 GTGATAC CCT C CT CTG CAGAAAAAATTATGGATTTGATGGAGCAG CCAGGTAACACTGTT 8 67 

lllllll II MMI IMIM IMIM MM MIIIIMI lllllll II 

Db 72 9 GTGATACGCTCCTCTGCAGAAAA TATGGACTTGAAGGAGCAGCCAGGTAACACTATT 785 

Qy 8 68 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 927 

Ml MMMMM MMI Ml Illllllll Illllllll III I II MM II III 

Db 78 6 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 84 5 

Qy 92 8 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 98 7 

II IMMMMMI I I IMIM MMMMM MMMMM II III II 

Db 84 6 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 9 05 

Qy 988 GTGTCATCCTC^GAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 1047 

lllllll Illllllll II IMIM I I I MIIIIMI MM I II 

Db 906 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 965 

Qy 104 8 GAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAGAATTAGAATAT 1107 

MM Mill II II I I II IllMM MM 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 

Db 966 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 1025 

Qy 1108 TCAGAAATGGGATCATCTTTTAAAGG CTCC CCAAAAGGAGAGTCAGC CATATTAGTAGAA 1167 

llllllllllll II I I Ml lllllll III II Ml II IMIM I 

Db 1026 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 1085 
Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGT 1224 

II III lllllll MM MM I MM II Mill I IMIM III I 

Db 108 6 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 114 5 



Qy 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 1269 

MINI I II Mill I 1 1 1 1 III III II 

Db 114 6 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1205 

Qy 1270 AGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTAGTA 132 9 

IIIIIIIIII lllllll III I II I MIMIIIIII I I 1 1 1 1 I I 

Db 12 06 GAAGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTGGAA 1265 

Qy 133 0 GCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGGGAAGTGAAA 138 9 

II III lllllll 1 1 IIIIIIIIIII II MINIM I II II MINI I 

Db 1266 GCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTGAAA 132 5 

Qy 1390 GATACTTATGAGGGAAGTAGGGATGTG CTGG CTG CTAGAG CT AATGTG 1437 

MM I III I III III II MINIM Ml I II II 

Db 1326 GATA GTAAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAATCGAGAG CAACTTG 1382 

Qy 1438 GAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGGAAG 14 97 

NNNNIINN I INNI II I lllllllll III I || I || 

Db 1383 GAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAAAAA 1442 

Qy 14 98 GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 1557 

IIIIIIII I II NIN III MINIMUM INNI I I INN 

Db 1443 GATAGTGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAGGAT 1502 

Qy 1558 AGCTCCAGAGCATATATTACCTGTGCTTCCTTTA CCTCAGCAACCGAAAGCACCACA 1614 

I II IIIIIIIIII II Illlll Illlll I 1111:1 II Mil II 

Db 15 03 CGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCA 1562 

Qy 1615 GCAAACACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAA 1674 

Illlll III INI Illlll INN III IIIIIIIIIII I II Mil II I 

Db 1563 ACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAAAAA 1622 

Qy 1675 ATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 1731 

IIIIIIIIII Mill III MM IIIIIIIIIII Illlll II Mill IIIII 

Db 1623 ATAGAAGAAAAGAAGG C C CAAATAGTAACAGAGAAGAATA C TAG CA C CAAAA CAT CAAAC 1682 

Qy 1732 CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 1791 

INN lllllll IIIIIIIIIIIII MIIN lllllll III II 

Db 1683 CCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACA 1742 

Qy 17 92 AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGT 1851 

IIIIIIIIIIIII II III -III Mill MMIIIMM Ml 

Db 1743 AAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTACAG 1802 

Qy 18 52 GAAGCATGTGAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTG 1911 

MMMMMIMIMM lllllllll II lllllllll IIIII IIMM Mill 

Db 18 03 GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAA CAAAA 1862 

Qy 1912 GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCACTTTACrc 1971 

IIIIIIIIII IIIIIIIIIIIII III IIIII IIIII II II IIIIIIIIIII 

Db 18 63 ATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAGCTT 1922 

Qy 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2 031 

MMIMIillMI II I IIIII IIIII lllllllllllllllll lllllllll 

Db 1923 TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 1982 



Qy 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2 091 



Db 1983 GAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCA 2042 

Qy 2 092 TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 

ii iii i 1 1 1 1 ii ii mill mill ii mil i 1 1 1 1 ! 1 1 1 1 1 1 1 

Db 2 043 TCACCATTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAAC 209 9 

Qy 2152 CCCCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAGCTTTGGGAACAAAGGAA 2208 

lllllllllllllllll Mlllll MM MM Mill I MM Mlllll 

Db 2100 CCCCCAC(^TATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAGGAA 2159 

Qy 22 09 GGAATAAAAGAG CCTGAAAGTTTTAATG CAG CTGTTCAGGAAACAGAAGCTC CTTATATA 2268 

I III I IMM! I I IIIIMIIMI MM 111111111111111111111 

Db 2160 GAAATTAAAGAG CCTGAAAATATTAATG CAG CTCTT CAAGAAACAGAAGCTC CTTATATA 2219 

Qy s TTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAG^f , TSg§E^ 8 

II Mill MUM MMM MM I III II MM Ml III MIMI 

Db 222 0 TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 227 9 

Qy 232 9 TCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTG 2388 

III ill': llllll ill ill Mill II M : I I I 

Db 228 0 TCTGATTATTCAGAAATGGCA^AAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTT 233 9 

Qy 23 8 9 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 244 8 

II II MIMI I I M MIMI IMM Illllllll I M Mill 

Db 234 0 GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 23 9 9 
Qy 244 9 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGA A 2502 

II Mlllll llllll II MIMIIMI MM II IMIIMMM I 

Db 24 00 GTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTC lyp CTS©A,24 59 

Qy 2503 GTGTCTGAGACAGTAGCCCAGCACAAAGAGGAGAGACTTAGTGCCTCACCTCAGGAGCTA 2562 

I I III I I II MM I III Mill I II I III I 

Db 24 60 TTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGCCACCTGAGGGA 2519 
Qy 2563 GGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCA 2619 

II M II II I II II I II llllll III II MM Ml IMIMIII I I 

Db 252 0 GGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTGTTA 257 9 

Qy 2 620 TCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTT 2679 

II MM II MINIMI IIIIMIIIIi III II 1 1 1 1 M I Mill Ml I 

Db 258 0 C CTGATGAAGTTTCAACATTGAG CAAAAAGGAGAAAATTC CTTTG CAGATGGAGGAG CTC 263 9 

Qy 268 0 AATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAG 273 9 

I Mlllll IMIIIIIIIIIIIMIIII II MMIIIMM I MM MM 

Db 264 0 AGTACTGCAGTTT ATGACTTATTTATTTCTCA&GO&EGCACAGATAAGAGAA 2699 

Qy 274 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 2799 

I llllll MMMMMMIMM IMM || I ! M || || || || 

Db 2700 ACTGAAACGTTTTCAGATTCA.TCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 2759 

Qy 28 00 AGTG CTAAAGATGATTC - - - TCCTAAATTAG C CAAGGAGTACACTGATCTAGAAGTAT C C 2856 

Ml Mill llllll I 1 1 1 1 1 1 1 1 1 1 f I III II Mill IMMIMMM 

Db 2760 AGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 2819 

Qy 28 57 GACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTG 2 916 

IMIIMIIIIIIMIIIII M II II I MMIIIMM I Ml I M 



Db 



282 0 CACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTG 2879 



Qy 2 917 CCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 2 970 

III llllllllllll Mill III I II I I I I Mill I Mill 

Db 2880 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCA 2 93 9 

Qy 2 971 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTC 3 030 

Mill M II Mill MM I III MM I III MM M MM 

Db 2 94 0 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 2 99 9 

Qy 3 031 TCTGCTTTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCAAATCACTTACGAAA 3 09 0 

MMMMI MM MM II I llllllllllll Mill III MM 

Db 3 000 TCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAA 3 05 9 

Qy 3 091 GAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTA 3150 

Mill MIMMMIMIMM II Mill MINI Mill I II III II 

Db 3 060 GAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCTGCTATA 3119 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

II MMMMMM llllll IMMIMI MUM Illlllllllllllllll 

Db 3120 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAG 3179 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 327 0 

IIIII.IMIIIIII lllll! II MIM MMI II IIIIIII llllll 

Db 318 0 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGC 323 9 

Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 333 0 

lllll II lllll MMMIMMMMMMMIMM lllll IMMMIMM 

Db 324 0 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 32 9 9 

Qy 3331 ATATATAAGGGCGTGATCCAGG CTAT CCAGAAATCAGATGAAGG C CAC CCATTCAGGG CA 3390 

lllll Mill IIIIMII I IIIIIMMMII! Mill! Illllllll I 

Db 33 00 ATATACAAGGGTGTGAT CCAAGCTAT C CAGAAATCAGATGAAGG CCAC CCATTCAGGG CA 335 9 

Qy 33 91 TATTTAGAATCTGAAGTTG CTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTG CT 345 0 

III I IMMMMMMMIMM lllll IMMMIMI MMMMMM 

Db 33 60 TATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCT 3419 

Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3 510 

. MMMMMM MM lllll lllll MM II 1 1 II II I II II I II 1 1 1 1 

Db 342 0 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 347 9 

Qy 3 511 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 3 570 

MIIIMI IIIIIIIIIIIIIMIIIMIMIM MM II II 1 1 II 1 1 1 1 M 1 1 1 

Db 3480 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 3 53 9 

Qy 3 571 AATGGTCTGACAC TAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 3 63 0 

Mill MMMMI MMI ||| II llllllllllll MMMMMM 

Db 354 0 AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 3599 

Qy 3 631 GAACGG CAT CAGGTG CAGATAGAT CATTAT CTAGGACTTG CAAACAAGAGTGTTAAGGAT 3 69 0 

MMMMI 1 1 r 1 1 r i i 1 1 1 e 1 1 1 1 1 f 1 1 r 1 1 1 r 1 1 1 1 mi www m 

Db 3 600 GAACGG CATCAGG CACAGATAGATCATTATCTAGGACTTG CAAATAAGAATGTTAAAGAT 3 65 9 

Qy 36 91 GCCA.TGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGA 374 0 

II lllll MlllllMIIIIII Mill! lllll IIIIIIMIII I II 

Db 3 660 GCTATGGCTAAAATCO^AGCAAAAATCCCTGGATTGAAGCGCAAAGCTGA 3709 



RESULT 4 

US-10-060-036-53 

Sequence 53, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 
APPLICANT: Benson, Darin R. 
APPLICANT: Kalos, Michael D. 
APPLICANT: Lodes , Michael J. 
APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/10/060 , 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 53 
LENGTH: 4632 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-060-036-53 

Query Match 62.6%; Score 2343.6; DB 15; Length 4632; 

Best Local Similarity 81.3%; Pred. No. 0; 

Matches 3 017; Conservative 0; Mismatches 574; Indels 119; Gaps 21; 
Qy 134 CACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACAACCGCCCGCGACT 193 

I II III I I III MINI Mill I 1 1 1 1 i : [ 1 1 1 1 1 ; I I ! I I i II 

Db 23 CTCGGCTCAGTCGGCCCAGCCCCTCTCAGTCCTCCCCAACCCCCACAACCGCCCGCGGCT 82 

Qy 194 CTGAGGAGAAGCGGC - CCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCGACCCGCCAGCC 2 52 

Mill I II I I I MM I IMIIIIIIIII M I MM MM 

Db 83 CTGAGACGCGGCCCCGGCGGCGGCGGCAGt^GCTGCAGCATCATC-TCCACCCTCCAGCC 141 

Qy 2 53 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 312 

! 1 1 M 1 1 1 1 I IMIIMI I Mill lllllll MIIMIMI 

Db 14 2 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 198 

Qy 313 CCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAG 3 72 

I Mill I lllllllllllllllllll I Mill Ml 1 1 Ml II I II II II III 

Db 199 CAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG GAAGAAGAG 2 55 

Qy 373 GAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAG 432 

1 1 1 1 1 IMIIMI II Mill Mill Mill IIIMIIIIIMIIIIMIII 

Db 256 GAGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAG CTGGAGGTG CTGGAGAGGAAG 315 
Qy 433 CCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCGCCGCGCCGCTG 4 86 

Mill IMIIMIIIIIM II I MM I II lllllll MUM III 

Db 316 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 375 

Qy 48 7 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 54 6 

IMIIMI I I MM Mill II IMIIMI II II III INN III 

Db 376 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 435 



Qy 54 7 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 597 

II I III II III MM II Mill I Mill I I I MM 

Db 436 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 4 95 

Qy 598 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

IMMIMM II I I MIMMIMM lllllllllllllll IMMMIMM 

Db 4 96 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 555 
Qy 658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

llllllll MM II MIM II II III III llllllll Ml III 

Db 556 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 615 

Qy 712 CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

Db promote 16 225XC CAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 675 

Qy 751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 807 

MINIM III MIMI M IIIIIIMIIIIMMMI! IMMMIMM 

Db 676 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 735 

Qy 808 GTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAGGTAACACTGTT 867 

: 1 1 1 1 1 1 lllllllllllllll MIMI MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 736 GTGATACGCTCCTCTGCAGAAAA TATGGACTTGAAGGAGCAGCCAGGTAACACTATT 792 

Qy 868 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGGCTCTCTTCCTTCT 927 

III MM MMM Mill; M IIMM IMIIMM IM MIMMIMM 

Db 793 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 852 

Qy 928 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 

J I Ml I I | | linn MMIMMtHmillMM || IT 

Db 853 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 912 

Qy 988 GTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 104 7 

II II II I MIMI II MMM I I I IMIIMM Mill I II 

Db 913 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 972 

Qy 104 8 GAGAGGG(^CAAATCCATTT$t4^ ll£700Xr 

MM Mill II II II II MM I M MM 1 1 1 1 1 1 1 1 1 1 1 : 1 I 

Db 973 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 1032 

Qy 1108 TCA.GAAATGGGATCATCTTTTAAAGG CTC CCCAAAAGGAGAGT CAG CCATATTAGTAGAA 1167 

1 1 M II II 1 1 II II 1 1 1 II I I III MM Ml II Ml M MIMI I 

Db 1033 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 1092 
Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGT 1224 

II III lllllll MM MM I MM II Mill I MIMI III I 

Db 1093 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1152 
Qy 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 1269 

MMM I II MIMI I MM III MIMI II 

Db 1153 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1212 

Qy 127 0 AGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTAGTA 132 9 

1 1 1 1 II M 11 IIIIMI III II 1 1 M I M II I M I I I MM II 

Db 1213 GAAGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTGGAA 1272 



Qy 


1330 


GCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGGGAAGTGAAA 
ii mi i i i i i i i ii i i i i i i i i i i i ii i i i i i i i i i it iiiiiiiiiiiii 


1389 


Db 


1273 


II III 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 II lllllllllllll 
G CTCCTATGAGGGAGGAATATGCAGACTTCAAAC CATTTGAG CGAGTATGGGAAGTGAAA 


1332 


Qy 


1390 


GATACTTATGAGGGAAGTAGGGATGTG CTGG CTG CTAGAG CT AATGTG 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ii it 


1437 


Db 


1333 


1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii ii 

GATA GTAAGGAAGATAGTGATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAACTTG 


1389 


Qy 


1438 


GAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGGAAG 


1497 


Db 


1390 


IIIIIIMIIIIII 1 llllll 1 1 1 Illllllll Illllll 1 II 1 II 

GAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAAAAA 


1449 


Qy 


1498 


GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 
i i i i i i i i i ii i i i i i iii i i i i i i i i i i i i i t i i iiiiii i i i i i i i 


1557 


Db 




1 1 1 1 1 1 1 1 1 II 1 1 1 II III 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 llllll | | | | | | | 

TGAGAGTAGTAATGATGATACTTCTTTCCCCAGTACGCCAGAAG$tATAi|4$$4t , 


1509 


Qy 


1558 


AGCTCCAGAGCATATATTACCTGTGCTTCCTTTA CCTCAGCAACCGAAAGCACCACA 

i ii i i i i i i i i i i ii i i i i t i iiiiii i iiiiiii i i i i i i ii 

III 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1| 1 1 1 || 

CGTTCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCA 


1614 


Db 


1510 


1569 


Qy 


1615 


GCAAACACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAA 


1674 


Db 


1570 


llllll III II! Mill MMIIMMIMM II IIIIIIMIMI 

ACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAAAAA 


1629 


Qy 


1675 


ATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 

I I I I I i i i i i i i i i i i i i i i i i i i i i i i i i i i i iiiiii iiiiiii i i i i i 


1731 


Db 


1630 


1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 llllll IIIIIII Mill 

ATAGAAGAAAAGAAGG C C CAAATAGTAA CAGAGAAGAATAC TAG CA C CAAAA CAT CAAA C 


1689 


Qy 


1732 


CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 
i i i i i i i i i i i i i i i iiiiiiiiiiiii iiiiiii i i i i i i i i i. i » i i 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 £1111 

CCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTAACA 


1791 
2448 
1749 


Db 


1690 


Qy 


1792 


AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 


1851 


Db 


1750 


lllllllllllll II Ml ML MM MM Mill MIMIMIM III 

AAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTACAG 


1809 


Qy 


1852 


GAAG CATGTGAAAGTGAACTGAATGAAG CCACAGGTACAAAGATTGCTTATGAAACAAAA 


1911 


Db 


1810 


IIIIIII Illllllll II 1 1 1 U II 1 M 

GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAA 


1869 


Qy 


1912 


GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCACTTTACCCCACAGCACAGCTT 

III lllllllllllll III Mill Mill II II lllllllllll 

ATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAGCTT 


1971 


Db 


1870 


1929 


Qy 


1972 


TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 
i i i i i i i i i i i i i i ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i itiiiiiii 

1 1 1 1 1 1 1 II M 1 1 1 II 1 1 II 1 1 Mill 1 1 II 1 II II 1 II 1 1 1 1 1 1 II II II 1 1 

TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 


2031 


Db 


1930 


1989 


Qy 


2032 


GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 

lllllllllll llllll MM II MIMIMIM II 1 llllllll 1 

GAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCA 


2091 


Db 


1990 


2049 


Qy 


2092 


TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 


2151 


Db 


2050 


M Ml 1 MM II II llllll llllll || Mill 1 MMMMIMM 

TCACCATTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAAC 


2106 



Qy 



2152 CCCCCACCATATGAAGAAGCCATGAATGTAGCACT - - - AAAAGCTTTGGGAACAAAGGAA 22 08 



MINIM Mill lllllll 1 1 1 1 1 1 1 1 Mill I 1 1 1 1 lllllll 

Db 2107 CCCCCACCATATGAAGAGGCCATGAGTGTATCACTAAAAAAAGTATCAGGAATAAAGGAA 2166 

Qy 22 09 GGAATAAAAGAG CCTGAAAGTTTTAATG CAG CTGTTCAGGAAACAGAAG CT C CTTATATA 2268 

I III I t I I I I I i I 1 I I I I lllllllllll I I I I 

Db 2167 GAAATTAAAGAG CCTGAAAATATTAATG CAG CTCTTCAAGAAACAGAAGCT CCTTATATA 222 6 

Qy 22 69 TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 232 8 

II Mill IMIIIIIIIIMIIIIIIIIIIIII II MM III Ml MUM 

Db 222 7 TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 228 6 

Qy 232 9 TCTAATTATTCAGAAATAG CAAAATT CGAGAAGT CGGTGCCCGAACACGCTGAG CTAGTG 238 8 

Ml MIMMMIMI llllll I M Ml Mill II II MIIIIMM 

Db 22 87 TCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTT 234 6 

Qy 23 8 

II IIIIMIIIIMM IIIIIIIIIIMMIIIIIIIIIIIIIMIM II Mill 

Db 234 7 GAAGATTCCTCACCTGASTCTGAACCAGTTGACTTATTTAGTGATGATTCfiATSCjKTGAfi 2 3 0 6 
Qy 244 9 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGA A 2 502 

II lllllll llllll II MIIIIMM MM II lllllllllll I 

Db 24 07 GTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCA 24 66 

Qy 2 5 03 GTGTCTGAGACAGTAGC CCAG CACAAAGAGGAGAGACTTAGTG CCTCACCT CAGGAG CTA 2562 

I I Ml I I II MM I III Mill I II I III I 

Db 24 67 TTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACT CAGTG CTTTG CCACCTGAGGGA 252 6 
Qy 2563 GGAAAGCCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCA 2619 

MINIMUM II MUM III II MM Ml Nil Ml I 

Db 2527 GGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAA5AGATACCCTGTTA 258 6 

Qy 2620 TCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTT 267 9 

II MM II IIMIIMI MMIMMMIMM lllllll Mill III I 

Db 2 587 C CTGATGAAGTTTCAACATTGAG CAAAAAGGAGAAAATTC CTTTG CAGATGGAGGAG CT C 264 6 

Qy 2680 AATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGAOWVATAAAAGAA 273 9 

I Ml I MMMMMMIMIMM II lllllllllll I MM MM 

Db 264 7 AGTACTG CAGTTTATT CAAATGATGACTTATTTATTT CTAAGGAAGCACAGATAAGAGAA 2706 

Qy 274 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 27 99 

I llllll MMIMMMIIMM Mill II Mi || M II II II 

Db 27 07 ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 2766 
Qy 28 00 AGTGCTAAAGATGATTC TCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCC 2856 

Ml Mill MIMI I 1 1 1 1 1 1 1 1 1 1 1 Ml II Mill II M 1 1 II M II 

Db 27 67 AGTTCTA^A&CTGXTb CTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 2826 

Qy 2 8 57 GACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTG 2 916 

II M II II II II 1 1 II I II I II II II I 1 1 M II II 1 1 II lllllll 

Db 2827 CACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTG 288 6 
Qy 2 917 CCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 2 97 0 

III llllllllllll Mill III I II MM Mill I I I III 

Db 28 8 7 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCT^ 2 94 6 

Qy 2971 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTC 3 03 0 

Mill II II lllll MM I III MM I III MM II MM 



Db 



2 94 7 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 3 006 



Qy 3 031 TCTGCTTTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCAAATCACTTACGAAA 3 090 

llllllllll I I I I INI II I lilllllllill Mill III I I I I 

Db 3 007 TCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAA 3066 

Qy 3 091 GAAGCAGAGAAAAAAGTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTA 315 0 

Mill IIIIMIIIIIIIIMI II Mill MIMMMIMM I II III II 

Db 3 067 GAAG CTGAGAAAAAACTTC CTTCCGATACAGAAAAAGAGGACAGAT CACCATCTGCTATA 3126 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

II 1 1 1 1 1 I Mill- 1 1 1 ; 1 1 1 i I ! 1 1 1 I 1 1 1 1 1 1 . ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

Db 3127 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAG 3186 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 32 7 0 

MMMMMIIMMMMMIMM IMIIIIIIIMI II lllllll llllll 

Db 3187 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGC 324 6 

Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 333 0 

Mill II IMM 1 1 1 : 1 1 1 1 1 : 1 1 1 I 1 1 1 ! I : I Mill MIMMIMM 

Db 324 7 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 33 06 

Qy 3331 ATATATAAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 33 9 0 

Mill Mill IIMII MMMMMMMMMIMMMMMIMMIMM 

Db 33 07 ATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 3366 

Qy 3391 TATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCT 34 50 

III I MMIII III IIIMIIII Mill MIMIMIM 111111111:11111 

Db 3367 TATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCT 342 6 

Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCO 5 OXA GTSTTtEfi 3510 

MIMMIMIMM MM Mill Mill Mill II 1 1 1 1 1 1 1 1 1 1 1 1 . 1 1 1 1 1 

Db 3427 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 34 8 6 

Qy 3 511 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 357 0 

1 1 MM 1 1 Ml III IIIIIMIIIIIIIIIII Mill 1 1 1 1 1 M 1 1 1 1 1 i 1 1 M 

Db 34 87 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 354 6 

Qy 3 571 AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 3 63 0 

Mill II. Mill; lllll IMM II MIMMIMM I MINI IN 

Db 3 54 7 AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 36 06 

Qy 3 631 GAACGG CAT CAGGTGCAGATAGATCATTATCTAGGACTTG CAAACAAGAGTGTTAAGGAT 3690 

1 1 1 1 1 f! 1 1 f 1 1 1 i MM llllll III 

Db 3 607 GAACGG CAT CAGG CACAGATAGATCATTATCTAGGACTTG CAAATAAGAATGTTAAAGAT 3666 

Qy 3 691 GCCATGGCCAAAATC AAATCCCTGGATTGAAGCGCAAAGCAGA 31AAC 477 

M lllll II IMMIIII llllll MINI llllilll M 

Db 3 667 G CTATGGCTAAAATCCAAG CAAAAATCCCTGGATTGAAG CGCAAAG CTGA 3 716 

RESULT 5 
US-09-789-386-1 

Sequence 1, Application US/09789386 
Patent No. US20020010324A1 
GENERAL INFORMATION: 
APPLICANT: MICHALOVICH, DAVID 



APPLICANT: PRINJHA, RABINDER KUMAR 
TITLE OF INVENTION: NOVEL COMPOUNDS 
FILE REFERENCE: GP-30165-C1 

CURRENT APPLICATION NUMBER: US/09/78 9 , 386 
CURRENT FILING DATE: 2001-02-21 
PRIOR APPLICATION NUMBER: U.K. 9916898.1 
PRIOR FILING DATE: 1999-07-19 
PRIOR APPLICATION NUMBER: U.K. 9816024.5 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: 1999-07-22 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 1 
LENGTH: 357 9 
TYPE: DNA 

ORGANISM: HOMO SAPIENS 
US-09-789-386-1 

Query Match 61.2%; Score 2289.2; DB 9; Length 3579; 

Best Local Similarity 81.5%; Pred. No. 0; 

Matches 2925; Conservative 0; Mismatches 548; Indels 117; Gaps 19; 
Qy 253 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCffGCCCCGGCCflt | 

MINIM! I MINIM I Mill II II I II II II 1 1 1 1 1 1 I II 1 1 1 II 

Db 1 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 57 

Qy 313 CCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAG 372 

I MINI MINI lllih I MM Mill MM; II II III 

Db 58 CAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG GAAGAAGAG 114 

Qy 373 GAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAG 432 

MINIM 1 1 1 II INN INN Mill I II I II II 1 1 1 1 1 II 1 1 1 N I 

Db 115 GAGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAG CTGGAGGTG CTGGAGAGGAAG 174 
Qy 433 CCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCGCCGCGCCGCTG 4 86 

INN 1 1 II I II II N I II II I INI I II MINN MINI IN 

Db 175 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 234 

Qy 48 7 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCG 

IIIINII II INI INN II MIIINI II II IMIMNIM III 

Db 23 5 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 2 94 
Qy 547 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 597 

II I III II III lllllll II INN I INN I I I INI 

Db 2 95 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 3 54 

Qy 5 98 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

1 1 1 1 ! 1 1 1 1 1 II I I MIMMMIN IIIIIIIININII MINIMUM 

Db 355 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 414 
Qy 658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

MMMN INI II INN II II III III IIIINII III IN 

Db 415 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 474 

Qy 712 CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

1111111111111*0*1111 ||$$t$ct:W$WWW 546 



Db 



475 TGGACCCCGCCAGCCCCGGCTCGCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 534 



Qy 751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 8 07 

Illllllll Illlllll III II: MIMIII III Ml III IMIIIM 

Db 53 5 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 594 

Qy 8 08 GTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGCAGCCAGGTAACACTGTT 867 

IMMII MIMMIMM llllll 1 1 1 1 MM IMMII MM II 

Db 5 95 GTGATACGCTCCTCTGCAGAAAA TATGGACTTGAAGGAGCAGCCAGGTAACACTATT 651 

Qy 868 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 927 

III Ml IIMMMmMIMI III MIMMIMM II 1 1 " 1 1 1 1 1 1 1 

Db 652 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 711 

Qy 92 8 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 987 

II MIM Ml I I MMM M MMM Mill III I II III II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 988 GTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 104 7 

II I I II I I MM I IMII llllll I I I III Mill III llllll II 

Db 772 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 104 8 GAGAGGGCAACAAATCCATTTGTAAATAGAGATTTAGCAGAATTTTCAGAATTAGAATAT 1107 

MM Mill II II I I II III II II II 1 1 MM llllll llllll Mill 

Db 832 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 8 91 

Qy 1108 TCAGAAATGGGATCATCTTTTAAAGGCTCCCCAAAAGGAGAGTCAGCCATATTAGTAGAA 1167 

1 1 1 1 1 1 1 1 1 : ! I 1 1 1 II I I III lllllll Ml M Ml II llllll I 

Db 8 92 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 951 
Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTA M I M$t 1224 

II III lllllll MM MM I MM II Mill I llllll Ml I 

Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 
Qy 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 1269 

llllll I II MMM I MM III MMM II 

Db 1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 12 7 0 AGAGTTGTGTCTCCAGAAAAGACAATGGACATTTTTAATGAAATGCAGATGTCAGTAGTA 132 9 

Mill MM III I Ml MM llllll III III I MM I I 

Db 1072 GAAGTTGTGT CTTCAGAAAAAG CAAAAGACAGTTTTAATGAAAAGAGAGTTG CAGTGGAA 1131 

Qy 133 0 GCACCTGTGAGGGAAGAGTATGGVGACTTTAAGCCATTTGAACAAGCATGGGAAGTGAAA 1389 

II Ml lllllll II M M IIIIMII I M Mi 

Db 1132 GCT CCTATGAGGGAGGAATATG CAGACTTCAAACCATTTGAG CGAGTATGGGAAGTGAAA 1191 
Qy 13 90 GATACTTATGAGGGAAGTAGGGATGTG CTGGCTG CTAGAG CT AATGTG 1437 

MM I Ml I III III II IIIIIIM Ml I M II 

Db 1192 GATA GTAAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAATCGAGAG CAACTTG 124 8 

Qy 1438 GAAAGTAAAGTGGACAGAAAATGCTTGGAAGATAGCCTGGAGCAAAAAAGTCTTGGGAAG 1497 

MIMMIMM I MMM II I Illllllll lllllll I II I II 

Db 124 9 GAAAGTAAAGTGGATAAAAAATGTTTTGCAGATAGCCTTGAGCAAACTAATCACGAAAAA 1308 

Qy 14 98 GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 1557 

IIIIMII I II Mill III IM III MMM MMM I I Mill 

Db 13 09 GATAGTGAGAGTAGTAATGATGATACTTCTTTCC CCAGTACG C CAGAAGGTATAAAGGAT 1368 
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I I llllllllll II MINI MINI I IIIIIII II 1 1 1 1 II 

CGTCCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCA 
GCAAACACTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAAAA 

llllll III llllllllll Mill i 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 M 

ACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAAAAA 
ATAGAAGAAAGGAAGG CCCAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 

llllllllll llllllllllll 1 1 1 1 1 1 1 1 1 M llllll IIIIIII Mill 

ATAGAAGAAAAGAAGGCCCAAATAGTAACAGAGAAGAATACTAGCACCAAAACATCAAAC 
CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 

Mill llllllllll IIIMIIIIIIII MM MM IIIIMIIII III II 

CCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAAaVGATA^ttt^ 1 ^ 
AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 

1 1 1 1 1 1 1 1 II 1 1 1 II III IIIMIIIIIIII! Mill IIIIIIIIIII III 

AAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTACAG 
GAAGCATGTGAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAA 

llllllll I Mill III IIIIIII II I III llllllll llllllll llllll 

GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAA 
GTGGACTTGGTCCAAACATCAGAAGCTATACAAGA^ 

llllllllll III IIIIIII III III Mill Mill II M MM llllll I 

ATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAGCTT 
TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 

,1111 II I Mill Mill II I 1 1 1 1 i 1 1 1 1 1 1 1 

TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 

GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 
IIIIIIIIIII llllll MM II IIIIIIIIIII II I llllllll I 



II III I MM II II llllll llllll II Mill I MMMMIMM 

TCACCATTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAAC 1965 

CCCCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAG CTTTGGGAACAAAGGAA 22 08 

MMMMMMIMM IIIIIII MM MM Mill I MM IIIIIII 



I Ml M I IIIIIII III I IIIIIIIIIII MM llllilll Mill IIIIIII 

GAAATTAAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTATATA 
TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 

II Mill I III I M I II 1 1 1 1 1 III I II II I II MM III III llllll 

TCTATTGCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTC 
T CTAATTATT CAGAAATAG CAAAATTCGAGAAGTCGGTG CCCGAACACG CTGAG CTAGTG 

Ml M MM MM MMM III Ml Mill II II IIIIMIIII 



Qy 238 9 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 2448 

II llllllllllllll i II Mill 

Db 22 06 GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 2265 
Qy 244 9 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGA A 2502 

II MM MINI II IMIMMII Illl II M Ml M I 

Db 2266 GTTCCACAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCA 232 5 

Qy 2503 GTGT CTGAGACAGTAG C CCAGCACAAAGAGGAGAGACTTAGTG CCTCACCTCAGGAGCTA 2562 

I I III I I II Illl I III Mill I II i Ml I 

Db 2326 TTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGCCACCTGAGGGA 2385 
Qy 2563 GGAAAGCCATATTTAGAGTCTTTTCAG CC CAATTTACATAGTACAAAAGATG C TGCA 2 619 

1 1 1 1 1 1 1 1 1 1 1 1 M M MUM III II Illl III 1 1 1 1 1 1 f 1 1 I I 

Db 2 3pffi 9GGASSGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTGTTA 244 5 

Qy 2 620 TCTAATGACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTT 2679 

II Illl II 1 1 M ! 1 1 1 1 II I II 1 1 1 1 1 II 1 1 1 ! Mill Mill Ml I 

Db 244 6 CCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAGGAGCTC 2 505 

Qy 268 0 AATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAA 273 9 

I MM 1 1 1 ! 1 1 1 1 1 1 ! 1 1 1 1 It 1 1 1 II IMMIMIM I Illl Illl 

Db 2506 AGTACTGCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAA 2 565 

Qy 274 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 2799 

I llllll lllllllllllllllll Mill II I M II II II II 

Db 2566 ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 262 5 
Qy 28 00 AGTG CTAAAGATGATTC TCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCC 2856 

Ml Mill llllll I llllllllllll III II Mill II II II III III 

Db 2 626 AGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 2685 

Qy 2 857 GACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTG 2 916 

IMMMIMMIMMIM II II II I llllllllllll Mill 

Db 2686 CACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTG 274 5 
Qy 2 917 C CCTGTGACCTTT CTTT CAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 2970 

III IMMMIMM Mill III I M Illl Mill I I MM 

Db 274 6 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCA 2805 

Qy 2 971 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTC 3 03 0 

Mill II II Mill MM I Ml MM I Ml Illl II Illl 

Db 28 06 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 2865 

Qy 3 031 TCTGCTTTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCAAATCACTTACGAAA 3 090 

1 1 1 1 1 1 1 1 1 1 MM Illl II I MIIIMIMM Mill Ml Illl 

Db 2 8 66 T CTG CTTTGG CCACTCAAG CAGAGATAGAGAG CATAGTTAAAC C CAAAGTTCTTGTGAAA 2 92 5 

Qy 3 091 GAAG CAGAGAAAAAACTT C CTTCTGACACAGAGAAAGAGGACAGAT CCCTGTCAG CTGTA 315 0 

Mill MIIIIIIIIMIMM II Mill MMMIMIMM I II III II 

Db 2 92 6 GAAGCTGAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCTGCTATA 2 98 5 

Qy 3151 TTGTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAG 3210 

II M I M 1 1 1 II II I M II II II 1 1 M M I II II II I II I 

Db 2 98 6 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAG 304 5 

Qy 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 327 0 



Ml ' I Ml 1 1 II I II II I M 1 1 1 1 M I III III II M Mill Mil! 

Db 3 04 6 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCCTGCTGCTTTCATTGACAGTATTCAGC 3105 



Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 3330 

Mill II Mill 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 i I E 1 1 Mill MMMIIMM 

Db 3106 ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 3165 

Qy 3331 ATATATAAGGG CGTGATCCAGG CTATCCAGAAAT CAGATGAAGGC CACCCATTCAGGG CA 33 90 

Mill MM IIIIIMI MMIIMMM IM III IIIMIIMMIIMI 

Db 3166 ATATAC^GGGTGTGATCOVAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 322 5 



Qy 3391 TATTTAGAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCT 3450 

III I 1 1 1 1 1 1 : 1 1 ! I M i 1 1 1 1 1 1 Mill lllllllllll II lllllllllllil 

Db 3226 TATCTGGAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCT 32 85 

Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3510 

IMMMMIIMM 1 1 1 1 Mill Mill Mill II Ml Ml IIIMIIM 

Db 3286 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 3345 

Qy 3511 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 3 570 

IIIIIMI IMIIIIIIIIIIIIIIIMIMIII Mill N I : I . 

Db 334 6 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 34 05 

Qy 3 571 AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 3 630 

1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 i 1 1 1 M 1 1 Mill II MM I Ml I III i 1 1 1 1 1 i 1 1 1 1 ; 1 1 

Db 34 06 AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 34 65 

Qy 3 631 GAACGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGAT 3 690 

1 1 1 1 1 1 i i : 1 1 1 1 1 ! 1 1 1! 1 1 1 1 1 i I ! 1 1 1 1 1 M I II 1 1 1 1 1 1 1 1 Ml 

Db 3466 GAACGG^TCAGGCGCAGATAGATC^TTATCTAGGACTTGOVAATAAGAATGTTAAAGAT 3525 

Qy 3691 GCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGA 374 0 

II Mill M 1 1 II 1 1 ! II MM 1 1 II II 1 1 1 Ml III 1 1 M II 

Db 3526 G CTATGG CTAAAAT C CAAGCAAAAAT CC CTGGATTGAAG CG CAAAG CTGA 3575 
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Qy 253 ATGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCT 312 

MINIMI I Ml Ml I Mill MMIM Mill. I MUM 

Db 1 ATGGAAGACCTGGACCAGTCTCCTCTGGT CTCGTCCTCGGACAGCCCACCCCGGCCG 57 

Qy 313 CCGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAG 372 

I MUM MM II MUM I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II II III 

Db 58 CAGCCCGCGTTCAAGTACCAGTTCGTGAGGGAGCCCGAGGACGAGGAG - - -GAAGAAGAG 114 

Qy 3 73 GAGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAG 432 

Mill: Mill II Mill Mill Mill IIIIMM llllll I 

Db 115 GAGGAGGAAGAGGAGGACGAGGACGAAGACCTGGAGGAGCTGGAGGTGCTGGAGAGGAAG 174 
Qy 433 CCCGCAGCCGGGCTGTCCGCAGCTGCGGTGC CGCCCGCCGCCGCCGCGCCGCTG 4 86 

IMM I MMM II II I MM I II III! MM III 

Db 17 5 CCCGCCGCCGGGCTGTCCGCGGCCCCAGTGCCCACCGCCCCTGCCGCCGGCGCGCCCCTG 234 

Qy 4 87 CTGGACTTCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCC 54 6 

IIIIMM I I 1 1 1 1 IMM M IIIIMM II II Ml MMM III . 

Db 235 ATGGACTTCGGAAATGACTTCGTGCCGCCGGCGCCCCGGGGACCCCTGCCGGCCGCTCCC 294 
Qy 54 7 CCTGCCGCTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCG CGGCGCCC 5 97 

II I III M III MMIM II Mill I Mill I I I MM 

Db 2 95 CCCGTCGCCCCGGAGCGGCAGCCGTCTTGGGACCCGAGCCCGGTGTCGTCGACCGTGCCC 354 

Qy 598 GCGCCATCCCTGCCGCCCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAG 657 

llllllllll II I I MIMMIMM MMMMMIMM IMMMMIM 

Db 355 GCGCCATCCCCGCTGTCTGCTGCCGCAGTCTCGCCCTCCAAGCTCCCTGAGGACGACGAG 414 

Qy 658 CCTCCGGCGAGGCCCCCGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAG 711 

Db 415 CCTCCGGCCCGGCCTCCCCCTCCTCCCCCGGCCAGCGTGAGCCCCCAGGCAGAGCCCGTG 474 

Qy 712 CCCGCCGCGCCCCCTTCCACGCCGGCCGCGCCCAAGCGC 750 

I ] 1 1 1 E 1 1 1 1 1 1 J 1 Mill MMM llllll M: 

Db 4 75 TGGACCCCGCCAGCCCCGGCTCCCGCCGCGCCCCCCTCCACCCCGGCCGCGCCCAAGCGC 534 
Qy 751 AGGGGCTCC GGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 8 07 

IIIIMM II II MMM MMMMIMM IIIIIMM llllll II 

Db 535 AGGGGCTCCTCGGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTGCTGCATCTGAGCCT 594 



Qy 808 GTGATACCCTCCTCTGCAGAAAAAATTATGGATTTGATGGAGC71GCCAGGTAACACTGTT 8 67 

lllllll MM MINI I MINI MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II 

Db 595 GTGATACGCTCCTCTGCAGAAAA TATGGACTTGAAGGAG CAG CCAGGTAACACTATT 651 

Qy 8 68 TCGTCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCT 927 

III MMMM llllll MMM MIMI II IMMMII Mill MM 

Db 652 TCGGCTGGTCAAGAGGATTTCCCATCTGTCCTGCTTGAAACTGCTGCTTCTCTTCCTTCT 711 

Qy 92 8 CTATCTCCTCTCTCAACTGTTTCTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCA 98 7 

II MMMM I I llllll MMI M MMI M M II III II 

Db 712 CTGTCTCCTCTCTCAGCCGCTTCTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACA 771 

Qy 988 GTGTCATCCTCAGAAGGAACAATTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCCA 104 7 

lllllll IMIIIIII II llllll I I I MIMI MMMM I II 

Db 772 GTATTACCCACTGAAGGAACACTTCAAGAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCA 831 

Qy 1048 GAGAGGG CAACAAATC CATTTGTAAATAGAGATTTAG CAGAATTTTCAGAATTAGAATAT 1107 

1 1 1 1 Mill llllll II MM MMI MM I M MMM Ml 

Db 832 GAGAAGGCAAAAACTCTACTCATAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATAC 8 91 

Qy 1108 TCAGAAATGGGAT CAT CTTTTAAAGG CTCCC CAAAAGGAGAGTCAGCCATATTAGTAGAA 1167 

Mill MMMM II I I Ml lllllll III II III II llllll I 

Db 8 92 TCAGAAATGGGATCATCGTTCAGTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCA 951 
Qy 1168 AACACTAAGGAAGAAGTAATTGTGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGT 1224 

II III lllllll MM MM I MM M Mill I MIMI III I 

Db 952 AATCCTAGGGAAGAAATAATCGTGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAAT 1011 
Qy 1225 GCAGCCCTTCACAGTCCACAAGAATCACCT GTGGGTAAAGAAGAC 1269 

llllll I II MMM I MM Ml MMM II 

Db 1012 AACATCCTTCATAATCAACAAGAGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGAT 1071 

Qy 127 0 AGAGTTGTGT CT CCAGAAAAGACAATGGACATTTTTAATGAAATG CAGATGTCAGTAGTA 132 9 

III MM lllllll III 1 1 II 1 1 II 1 1 II 1 1 1 I I MM I I 

Db 1072 GAAGTTGTGTCTTCAGAAAAAGCAAAAGACAGTTTTAATGAAAAGAGAGTTGCAGTGGAA 1131 

Qy 133 0 GCACCTGTGAGGGAAGAGTATGCAGACTTTAAGCCATTTGAACAAGCATGGGAAGTGAAA 138 9 

II III lllllll II MMIMIMI II MMMM I II 

Db 1132 GCTCCTATGAGGGAGGAATATGCAGACTTCAAACCATTTGAGCGAGTATGGGAAGTGAAA 1191 
Qy 13 90 GATACTTATGAGGGAAGTAGGGATGTG CTGG CTG CTAGAG CT AATGTG 1437 

MM Mill Ml III II MMMM III I II II 

Db 1192 GATA GTAAGGAAGATAGTGATATGTTGG CTG CTGGAGGTAAAAT CGAGAG CAACTTG 124 8 

Qy 14 38 GAAAGTAAAGTGGACAGAAAATG CTTGGAAGATAG CCTGGAG CAAAAAAGTCTTGGGAAG 14 97 

MMI IMMM I MIMI II I Mill MMM I II I II 

Db 124 9 GAAAGTAAAGTGGATAAAAAATGTTTTG CAGATAG CCTTGAG CAAACTAATCACGAAAAA 13 08 

Qy 14 98 GATAGTGAAGGCAGAAATGAGGATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGAC 1557 

MMMM I II Mill III IIIIMIIMIIIMI llllll I I Mill 

Db 13 09 GATAGTGAGAGTAGTAATGATGATACTT CTTTCC CCAGTACG CCAGAAGGTATAAAGGAT 1368 
Qy 1558 AGCTCCAGAGCATATATTACCTGTGCTTCCTTTACCT CAGCAACCGAAAGCACCACA 1614 

i i Milium ii Mini mini i iiiiiii ii mm m 

Db 1369 CGTCCAGGAGCATATATCACATGTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTG^ 1428 



Qy 1615 GCAAAC^CTTTCCCTTTGTTAGAAGATCATACTTCAGAAAATAAAACAGATGAAAAAA^ 1674 

III III Illlllllll Mill ,11 lllllll II IIIIIMIIIII 

Db 142 9 ACAAACATTTTTCCTTTGTTAGGAGATCCTACTTCAGAAAATAAGACCGATGAAAAAAAA 14 88 
Qy 1675 ATAGAAGAAAGGAAGGCCCAAATTATAACAGAGAAG ACTAGCCCCAAAACGTCAAAT 1731 

Illlllllll llllllllllll I; III I MUM Mill: Mill 

Db 14 89 ATAGAAGAAAAGAAGG C C CAAATAG TAA CAGAGAAGAATA CTAG CAC CAAAACAT CAAA C 1548 

Qy 1732 CCTTTCCTTGTAGCAGTACAGGATTCTGAGGCAGATTATGTTACAACAGATACCTTATCA 17 91 

Mill IIMMIMI I MUM M MINIM IMIMIMI III II 

Db 154 9 CCTTTTCTTGTAGCAGCACAGGATTCTGAGACAGATTATGTCACAACAGATAATTTAACA 16 08 

Qy 17 92 AAGGTGACTGAGGCAGCAGTGTCAAACATGCCTGAAGGTCTGACGCCAGATTTAGTTCAG 1851 

IIIIIMM II III IIMIIIIIIIMIII Mill 1 1 1 ! 1 1 1 ; 1 1 1 III 

Db 1609 AAGGTGACTGAGGAAGTCGTGGCAAACATGCCTGAAGGCCTGACTCCAGATTTAGTACAG 1668 

Qy 1852 GAAGCATGTGAAAGTGAACTGAATGAAGCCACAGGTACAAAGATTGCTTATGAAACAAAA 1911 

I IMIMIII MM: MIIIIMI II 1 1 M Ml M Ml I II Ml II 1 1 II I 

Db 1669 GAAGCATGTGAAAGTGAATTGAATGAAGTTACTGGTACAAAGATTGCTTATGAAACAAAA 1728 

Qy 1912 GTGGACTTGGTCCAAACATCAGAAGCTATACAAGAATCACTTTACCCCACAGCACAGCTT 1971 

Illlllllll II I II Ml Mill III Mill Mill M II MUM Mill 

Db 172 9 ATGGACTTGGTTCAAACATCAGAAGTTATGCAAGAGTCACTCTATCCTGCAGCACAGCTT 17 8 8 

Qy 1972 TGCCCATCATTTGAGGAAGCTGAAGCAACTCCGTCACCAGTTTTGCCTGATATTGTTATG 2031 

MMMMMIMI II i Mill Mill IIIMIMIIIIIIMI IIIIIMM 

Db 178 9 TGCCCATCATTTGAAGAGTCAGAAGCTACTCCTTCACCAGTTTTGCCTGACATTGTTATG 184 8 

Qy 2 032 GAAGCACCATTAAATTCTCTCCTTCCAAGCGCTGGTGCTTCTGTAGTGCAGCCCAGTGTA 2 091 

IIIIIMM 1 1 1 1 1 1 MM II MIMIMIM M I Mill! I 

Db 184 9 GAAGCACCATTGAATTCTGCAGTTCCTAGTGCTGGTGCTTCCGTGATACAGCCCAGCTCA 1908 

Qy 2 092 TCCCCACTGGAAGCACCTCCTCCAGTTAGTTATGACAGTATAAAGCTTGAGCCTGAAAAC 2151 

II Ml I MM II II MUM llllll II Mill I 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1909 TCACCATTAGAAG CTTCTTCAGTTAATTATGAAAGCATAAAACATGAGCCTGAAAAC 1965 

Qy 2152 CCCCCACCATATGAAGAAGCCATGAATGTAGCACT AAAAGCTTTGGGAACAAAGGAA 22 08 

I I I f 1 1 f 1 1 1 1 1 1 1 [ I f lllllll MM MM Mill I MM lllllll 

Db 1966 CCCCCAC CATATGAAGAGG C CATGAGTGTAT CACTAAAAAAAGTATCAGGAATAAAGGAA 2 025 

Qy 22 09 GGAATAAAAGAGCCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATA 2268 

I III MIMIMIIMI I lllllllllll MM MIMIIMIMMMIMM 

Db 2 026 GAAATTAAAGAGCCTGAAAATATTAATG CAG CT CTTCAAGAAACAGAAG CTCCTTATATA 2 085 

Qy 2269 TCCATTGCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTC 2328 

II Mill 1 1 1 1 M 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II MM Ml III llllll 

Db 2 086 TCTATTG CATGTGATTTAATTAAAGAAACAAAG CTTTCTG CTGAAC CAG CT CCGGATTTC 214 5 

Qy 232 9 TCTAATTATTCAGAAATAGCAAAATTCGAGAAGTCGGTGCCCGAACACGCTGAGCTAGTG 2388 

Ml MMIIIMM llllll III II I Mill II II IMIMIMI 

Db 2146 TCTGATTATTCAGAAATGGCAAAAGTTGAACAGCCAGTGCCTGATCATTCTGAGCTAGTT 22 05 

Qy 23 89 GAGGATTCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAA 2448 

II I I I II I . ,1 II I I , ! ! I II, || Mill 

Db 22 06 GAAGATTCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGAC 2265 



Qy 



2449 GTCCCACAAACACAAGAGGAGGCTGTGATGCTCATGAAGGAGAGTCTCACTGA A 2502 



ii iiiiiii iiiiii ii Milium mi ii iiiiiiiiiii i 

Db 2266 GTTCC^CAAAAACAAGATGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCA 2325 

Qy 2 5 03 GTGTCTGAGACAGTAGCCCAGCACAAAGAGGAGAGACTTAGTGCCTCACCTCAGGAGCTA 2562 

> I I I I I I II I I I I I III INN I II I III I 

Db 2326 TTTGAGTCAATGATAGAATATGAAAATAAGGAAAAACTCAGTGCTTTGCCACCTGAGGGA 2385 

Qy 2563 GGAAAGCCATATTTAGAGTCTTTT CAG CC CAATTTACATAGTACAAAAGATG C TGCA 2 619 

llilllllllllll II Illlll Ml I! MM III 1 1 1 1 [ 1 1 i I I I 

Db 23 8 6 GGAAAGCCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTGTTA 24 45 

Qy 2 62 0 TCTAATGACA.TTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTT 2679 

II MM II 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 MIMM Mill Ml | 

Db 244 6 CCTGATGAAGTTTCAACATTGAGCAAAAAGGAGAAAATTCCTTTGCAGATGGAGGAGCTC 2505 

Qy 2 68 0 AATACTGCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAA 2739 

I lllllll MM MM Mill III II lllllllllll I MM MM 

Db 25 06 AGTACTG CAGTTTATTCAAATGATGACTTATTTATTT CTAAGGAAG CACAGATAAGAGAA 2565 

Qy 274 0 AGTGAAACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTC 279 9 

I Illlll hi I I II h Mill II Ml || || || II II 

Db 2566 ACTGAAACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATC 2 625 
Qy 28 00 AGTG CTAAAGATGATTC TCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCC 28 56 

Ml Mill Illlll I MIMMIMM III M Mill 

Db 2626 AGTTCTAAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCC 268 5 

Qy 2857 GACAAAAGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTG 2 916 

llllllllllllllllllll II II II I 1:1 MIMM 

Db 2686 CACAAAAGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTG 2745 
Qy 2 917 CCCTGTGACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCA 2970 

Ml MIMMIMM Mill III I II MM Mill I I I Ml 

Db 2746 CCCCATGACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCA 2 805 

Qy 2971 GATGAATTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTC 3 03 0 

Mill II II Mill MM I III MM I Ml MM II MM 

Db 2806 GATGACTTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTT 2865 

Qy 3 031 TCTGCTTTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCAAATCACTTACGAAA 3090 

IIMIIMM MM MM II I IMIMMIMI Mill III MM 

Db 28 66 TCTGCTTTGGCCACTCAAGCAGAGATAGAGAGCATAGTTAAACCCAAAGTTCTTGTGAAA 2 925 

Qy 3 091 GAAGCAGAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTA 3150 

MM! 1 1 1 1 M 1 1 1 1 1 ! 1 1 1 1 1 II 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 I MIMM 

Db 2 926 GAAG CTGAGAAAAAACTTC CTTC CGATACAGAAAAAGAGGACAGATCACCATCTG CTATA 2 98 5 

Qy 3151 TTGT CAGCAGAG CTGAGTAAAACTT CAGTTGTTGAC CT CCTCTACTGGAGAGACATTAAG 3210 

M IIIIIIIMIIIIIIIIMIIIIIIIIIIIIIMIIII IIIIIIIIIIIIMIIII 

Db 2 986 TTTTCAGCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAG 3 04 5 

QY 3211 AAGACTGGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGC 3270 

mmiimiiiiiiiiiiiiiiiiiii MMimimi m mmi mm 

Db 3 04 6 AAGACTGGAGTGGTGTTTGGTGCCAGCCTATTCGTGCTGCTTTCATTGACAGTATTCAGC 3105 

Qy 3271 ATTGTCAGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGG 3330 

Mill II Mill IMIIIIIIIIMIIIIMIIIIIII Mill IMIMMIMI 



Db 



3106 



ATTGTGAGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGG 



3165 



Qy 3331 ATATATAAGGGCGTGATCCAGGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 33 90 

Mill Mill llllllll IMIIIIIMIIIIIIIIMMI III Ml III I 

Db 3166 ATATACAAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCA 322 5 

Qy 3391 TATTTAGAATCTGAAGTTG CTATAT CAGAGGAATTGGTT CAGAAATACAGTAATTCTG CT 34 50 

Ml I I Ml 1 1 Ml Ml 1 1 1 1 1 Mill IMIIMMM MMM III M 

Db 322 6 TATCTGGAAT CTGAAGTTG CTATAT CTGAGGAGTTGGTT CAGAAGTACAGTAATTCTGCT 32 8 5 

Qy 3451 CTTGGTCATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTA 3510 

M III IMMII Ml! IMM Mill Mill II 1 1 1 1 1 1 1 1 1 1 1 1 II I II I 

Db 328 6 CTTGGTCATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTA 334 5 

Qy 3 511 GTTGATTCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTC 3570 

Mill M II Ml MM 1 1 M Ml Ml 1 1 1 Mill 1 1 1 1 1 M 1 1 1 1 E I M 1 1 

Db 334 6 GTTGATTCTCTGAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTT 34 05 

Qy 3 571 AATGGTCTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTAT 363 0 

I IMMM MMI MMI Mill II MIMMIMM MMMIMIMM 

Db 34 06 AATGGTCTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTAT 34 65 

Qy 3631 GAACGGCATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGAT 3690 

lllllllllllll MMM Ml II Ml MMM ilMIIIIII MM MMM III 

Db 34 66 GAACGGCATCAGGCGCAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTTAAAGAT 3525 

Qy 3691 GCCATGGCCAAAATCCAAGCAAAAATCCCTGGATTGAAGCGCAAAGCAGA 374 0 

II Mill MMIMIMI MMM IMMM III IM. ; II 

Db 3526 G CTATGG CTAAAAT CCAAG CAAAAAT CCCTGGATTGAAG CG CAAAG CTGA 3575 



RESULT 7 

US-10-220-891-22 

Sequence 22, Application US/10220891 
Publication No. US20030207286A1 
GENERAL INFORMATION: 
APPLICANT: NAKAGAWARA, AKIRA 

TITLE OF INVENTION: NUCLEIC ACID SEQUENCES HAVING CHARACTERITICS OF ENHANCED 
TITLE OF INVENTION: EXPRESSION IN HUMAN NEUROBLASTOMA WITH FAVORABLE 
PROGNOSIS 

; TITLE OF INVENTION: BASED ON COMPARISON BETWEEN HUMAN NEUROBLASTOMA WITH 
FAVORABLE 

; TITLE OF INVENTION: PROGNOSIS AND HUMAN NEUROBLASTOMA WITH UNFAVORABLE 
PROGNOSIS 

FILE REFERENCE: 7388-73435 

CURRENT APPLICATION NUMBER : US/ 10/22 0 , 8 9 1 
CURRENT FILING DATE: 2003-03-07 
PRIOR APPLICATION NUMBER: JP 2000/140387 
PRIOR FILING DATE: 2000-05-12 
PRIOR APPLICATION NUMBER: JP 2000/159195 
PRIOR FILING DATE : 2000-03-07 
NUMBER OF SEQ ID NOS : 108 
SOFTWARE: Patent In version 3.2 
SEQ ID NO 22 
LENGTH: 1980 
TYPE: DNA 

ORGANISM: Homo sapiens 



US-10-220-891-22 



Query Match 29.1%; Score 1088.8; DB 13; Length 1980; 

Best Local Similarity 83.5%; Pred. No. 1.3e-269; 

Matches 128 9; Conservative 0; Mismatches 237; Indels 18; Gaps 4; 
Qy 2215 AAAGAGCCTGAAAGTTTTAATGCAGCTGTTCAGGAAACAGAAGCTCCTTATATATCCATT 2274 

IIIIIIIII I Illllllllil 1 1 1 1 II MIIIMI llllll M Ml 

Db 28 AAAGAGCCTGAAAATATTAATGCAGCTCTTCAAGAAACAGAAGCTCCTTATATATCTATT 87 

Qy 2275 GCGTGTGATTTAATTAAAGAAACAAAGCTCTCCACTGAGCCAAGTCCAGATTTCTCTAAT 2334 

II MMMIMIIMM llllll I || I I I I Ml Ml IIIIIIIII II 

Db 88 GCATGTGATTTAATTAAAGAAACAAAGCTTTCTGCTGAACCAGCTCCGGATTTCTCTGAT 147 

Qy 2335 TATTCAGAAATAG CAAAATTCGAGAAGTCGGTG C CCGAACACG CTGAG CTAGTGGAGGAT 2394 

Illllllllil llllll I II II I Mill 1 1 II MIIIIMM II III 

Db 14 8 TATTCAGAAATGG CAAAAGTTGAACAG CCAGTG CCTGATCATT CTGAG CTAGTTGAAGAT 207 

Qy 2395 TCCTCACCTGAATCTGAACCAGTTGACTTATTTAGTGATGATTCGATTCCTGAAGTCCCA 2454 

II llllll: Mlllll llllll M llllll llllll, II Mill II III 

Db 2 08 TCCTCACCTGATTCTGAACCAGTTGACTTATTTAGTGATGATTCAATACCTGACGTTCCA 267 
Qy 2455 CAAACACAAGAGGAGG CTGTGATGCTCATGAAGGAGAGTCT CACTGA AGTGTCT 25 08 

MM Mill II llllllllll MM II IIMMMMI I I 

Db 268 CAAAAACAAGGTGAAACTGTGATGCTTGTGAAAGAAAGTCTCACTGAGACTTCATTTGAG 32 7 

Qy 2509 GAGACAGTAG CC CAG CACAAAGAGGAGAGACTTAGTG CCTCAC CTCAGGAG CTAGGAAAG 2568 

llllll II MM I III Mill I II I III Mlllll 

Db 328 TCAATGATAGAATATGAAAATAAGGAAAAACT CAGTG CTTTG C CAC CTGAGGGAGGAAAG 387 
Qy 2569 CCATATTTAGAGTCTTTTCAGCCCAATTTACATAGTACAAAAGATGC TGCATCTAAT 2 625 

MIIIMI II llllll III II MM III IIIIIIIII I Mill 

Db 388 CCATATTTGGAATCTTTTAAGCTCAGTTTAGATAACACAAAAGATACCCTGTTACCTGAT 447 

Qy 2 626 GACATTCCAACATTGACCAAAAAGGAGAAAATTTCTTTGCAAATGGAAGAGTTTAATACT 2685 

II II IIIIIIIII IMMMIMIIMM Mlllll Mill III I I MM 

Db 448 GAAGTTTCAACATTGAG CAAAAAGGAGAAAATT C CTTTG CAGATGGAGGAGCTCAGTACT 507 

Qy 2686 GCAATTTATTCAAATGATGACTTACTTTCTTCTAAGGAAGACAAAATAAAAGAAAGTG^ 274 5 

III Mlllll llllll MM II MIIIMI I MM Mill MM 

Db 508 GCAGTTTATTCAAATGATGACTTATTTATTTCTAAGGAAGCACAGATAAGAGAAACTGAA 567 

Qy 274 6 ACATTTTCAGATTCATCTCCGATTGAGATAATAGATGAATTTCCCACGTTTGTCAGTGCT 28 05 

II M 1 1 1 1 M I II 1 1 1 1 Mill II MIIIMI II II II II Mill II 

Db 568 ACGTTTTCAGATTCATCTCCAATTGAAATTATAGATGAGTTCCCTACATTGATCAGTCCT 627 
Qy 28 06 AAAGATGATTC TCCTAAATTAGCCAAGGAGTACACTGATCTAGAAGTATCCGACAAA 2862 

Ml llllll I Ml MUM Ml M Mill l MIIIMI Mill 

Db 628 AAAACTGATTCATTTTCTAAATTAGCCAGGGAATATACTGACCTAGAAGTATCCCACAAA 687 

Qy 2 863 AGTGAAATTGCTAATATCCAAAGCGGGGCAGATTCATTGCCTTGCTTAGAATTGCCCTGT 2 922 

iiiiiiiiiiiiiii ii ii ii i 1 1 1 ; 1 1 1 1 1 1 1 1 iiiiiiiiii i 

Db 688 AGTGAAATTGCTAATGCCCCGGATGGAGCTGGGTCATTGCCTTGCACAGAATTGCCCCAT 747 
Qy 2 923 GACCTTTCTTTCAAGAATATATATCCTAAAGATGAAG TACATGTTTCAGATGAA 2976 

1 1 1 M 1 1 1 1 1 Mill Ml I M MM Mill I I I IIIIIII 

Db 748 GACCTTTCTTTGAAGAACATACAACCCAAAGTTGAAGAGAAAATCAGTTTCTCAGATGAC 8 07 



Qy 2 977 TTCTCCGAAAATAGGTCCAGTGTATCTAAGGCATCCATATCGCCTTCAAATGTCTCTGCT 3 03 6 

II II Mill Mil I III MM I III MM II MM llllll 

Db 808 TTTTCTAAAAATGGGTCTGCTACATCAAAGGTGCTCTTATTGCCTCCAGATGTTTCTGCT 867 

Qy 3037 TTGGAACCTCAGACAGAAATGGGCAGCATAGTTAAATCCAAATC^ 3 096 

1 1 1 1 Mill 1 1 1 1 II I llllllllllll Mill Ml i 1 1 1 1 1 1 1 1 

Db 868 TTGGCCACTCAGG CAGAGATAGAGAG CATAGTTAAACC CAAAGTTCTTGTGAAAGAAGCT 927 

Qy 3 097 GAGAAAAAACTTCCTTCTGACACAGAGAAAGAGGACAGATCCCTGTCAGCTGTATTGTCA 3156 

IIIIIIIIMIIIIMI 1 1 Mill 1 1 1 1 1 1 1 1 1 1 i 1 1 1 I M III MM III 

Db 928 GAGAAAAAACTTCCTTCCGATACAGAAAAAGAGGACAGATCACCATCTGCTATATTTTCA 987 

Qy 3157 GCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACT 3216 

I'll: IIMIIMI II hill! IIIMII 1 1 1 1 1 i I 1 1 . 1 1 1 ' I i 1 : 1 1 I 

Db 988 GCAGAGCTGAGTAAAACTTCAGTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACT 1047 

Qy 3217 GGAGTGGTGTTTGGTGCCAGCTTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTC 3276 

IIIIIIIIIIIIIIMMIII Mllli llllll II Ml MIMIIMM 

Db 104 8 GGAGTGGTGTTTGGTGCCAGCCTATTCCAGCTGCTTTCATTGACAGTATTCAGCATTGTG 1107 

Qy 3277 AGTGTAACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATAT 3336 

II Mill II IMIM llllll iriill Mill Ml Ml II IN! 

Db 1108 AGCGTAACAGCCTACATTGCCTTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATAC 1167 

Qy 3337 AAGGG CGTGATCCAGG CTAT CCAGAAATCAGATGAAGGC CACC CATT CAGGGCATATTTA 3396 

MMI 1 1 1 S I ' 1 1 1 1 i : 1 1 1 : 1 1 1 1 1 1 1 1 : 1 1 1 1 1 . 1 1 i 1 1 1 1 1 1 1 1 I 

Db 1168 AAGGGTGTGATCCAAGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCATATCTG 1227 

Qy 3397 GAATCTGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGT 3456 

I II ill III MM MMI MM MMI II II I M 1 1 II 1 1 1 II 1 1 1 1 1 1 1 

Db 1228 GAATCTGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGT 128 7 

Qy 3457 CATGTGAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGAT 3516 

h II M MM Mill Mill II II I II MMIM IMIM IMIM 

Db 12 88 CATGTGAACTGCACGATAAAGGAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGAT 1347 

Qy 3517 TCCCTGAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGT 3576 

II III MMMIMMMMMIMM Mill MMMMIMIIMM llllll 

Db 1348 TCTCTGGAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGT 14 07 

Qy 3577 CTGACACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGG 363 6 

MMMMIMIIMM Mill II I MMIM II 1 1 1 II II I II I II II II I 

Db 14 08 CTGACACTACTGATTTTGGCTCTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGG 1467 

Qy 3637 CATCAGGTGCAGATAGATCATTATCTAGGACTTGCAAACAAGAGTGTTAAGGATGCCATG 3696 

MMIM 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 ! I MM llllll Mill III 

Db 1468 CATCAGGCACAGATAGATCATTATCTAGGACTTGCAAATAAGAATGTTAAAGATGCTATG 1527 

Qy 3697 G CCAAAATC CAAG CAAAAATCC CTGGATTGAAG CG CAAAGCAGA 3 74 0 

II M MUM IMIM II llllll IMIIMI II 

Db 1528 G CTAAAATC CAAG CAAAAATCC CTGGATTGAAG CG CAAAG CTGA 1571 



RESULT 8 

US-10-205-194-165 

; Sequence 165, Application US/10205194 



Publication No. US2 0030134301A1 
GENERAL INFORMATION: 
APPLICANT: Warner-Lambert Company 
APPLICANT: Lee, Kevin 
APPLICANT: Dixon, Alistair 
APPLICANT: Brooksbank, Robert 
APPLICANT: Pinnock, Robert 

TITLE OF INVENTION: Identification and Use of Molecules Implicated in Pain 
FILE REFERENCE: WL-A-018201 

CURRENT APPLICATION NUMBER: US/ 10/2 05 , 1 94 
CURRENT FILING DATE: 5200-07-24 
PRIOR APPLICATION NUMBER : GB 0118354.0 
PRIOR FILING DATE: 2001-07-27 
NUMBER OF SEQ ID NOS : 177 
SOFTWARE: Patent In Ver . 2.1 
SEQ ID NO 165 
LENGTH: 2782 
TYPE : DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 

OTHER INFORMATION: Foocen-m2 reticulon 
US-10-205-194-165 

Query Match 21.6%; Score 809.8; DB 13; Length 2782; 

Best Local Similarity 99.8%; Pred. No. 1.7e-197; 

Matches 811; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 14 G CGG CGG CGG CGG CTG CAG C CTGGGACAGGG CGGGTGG CACATCT CGATCG CGAAGG CAG 73 

Mill MMM M IIMII MMM IMMiill MMM IMIIIIM I 

Db 462 GCGGCGGCGGCGGCTGCAGCCTGGGACAGGGCGGGTGGCACATCTCGATCGCGAAGGCAG 521 

Qy 74 CAGAAGCAGTCTCATTGTTCCGGGAGCCGTCGCCTCTGCAGGTTCTTCGGCTCGGCTCGG 133 

I I I I I I I II I I I I I M I I I I I I I I I I I I I I I II I M I I I I II I M I I I I I I I I I I 1 I I I 

Db 522 GAGAAGCAGTCTCATTGTTCCGGGAGCCGTCGCCTCTGCAGGTTCTTCGGCTCGGCTCGG 581 

Qy 134 (^CGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACAACCGCCCGCGACT 193 

I'lll II IIMI III Mill III MINI 1 1 II I II III 1 1 III II I II III II I 

Db 582 CACGACTCGGCCTGCCTGGCCCCTGCCAGTCTTGCCCAACCCCCACAACCGCCCGCGACT 641 

Qy 194 CTGAGGAGAAGCGGCCCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCGACCCGCCAGCCA 253 

I IMM MMM IMIIIIM MMM MMM MIMIMM IIMII M 

Db 642 CTGAGGAGAAGCGGCCCTGCGGCGGCTGTAGCTGCAGCATCGTCGGCGACCCGCCAGCCA 701 

Qy 254 TGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCTC 313 

i I 1 1 1 1 1 1 - ! ! I ! i I : I ; M 1 1 ! : I i 1 1 i I 1 1 1 i I i 1 1 1 1 1 , M I < I i I i 1 1 1 1 

Db 702 TGGAAGACATAGACCAGTCGTCGCTGGTCTCCTCGTCCACGGACAGCCCGCCCCGGCCTC 761 

Qy 314 CGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAGG 373 

I MMM MMM M IIMII MMM Illlll MIMMMMMMMM 

Db 762 CGCCCGCCTTCAAGTACCAGTTCGTGACGGAGCCCGAGGACGAGGAGGACGAGGAGGAGG 821 

Qy 374 AGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAGC 433 

I Illlll IllMI M M ill MMM Illlll MMMMMIMIMM 

Db 822 AGGAGGACGAGGAGGAGGACGACGAGGACCTAGAGGAACTGGAGGTGCTGGAGAGGAAGC 881 

Qy 434 CCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCGCCGCTGCTGGACT 4 93 

I Illlll IllMI M IIMII Illlll Illlll MMMMMMMIMM 



Db 8 82 CCGCAGCCGGGCTGTCCGCAGCTGCGGTGCCGCCCGCCGCCGCCGCGCCGCTGCTGGACT 941 

Qy 4 94 TCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCCCCTGCCG 553 

MM II MMI II III II IM II IIMM !; MUM MIMIM 

Db 942 TCAGCAGCGACTCGGTGCCCCCCGCGCCCCGCGGGCCGCTGCCGGCCGCGCCCCCTGCCG 1001 

Qy 554 CTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCGCCATCCCTGCCGC 613 

IIMIMI IMMMM IMMMIIIIM MIMMI IIMM llllll M 

Db 1002 CTCCTGAGAGGCAGCCATCCTGGGAACGCAGCCCCGCGGCGCCCGCGCCATCCCTGCCGC 1061 

Qy 614 CCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCTCCGGCGAGGCCCC 673 

M IIMM Mill! II MUM llllll IMMMM llllll MIMIM 

Db 1062 CCGCTGCCGCAGTCCTGCCCTCCAAGCTCCCAGAGGACGACGAGCCTCCGGCGAGGCCCC 1121 

Qy 674 CGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCGCCCCCTTCCACGC 733 

MIMIM II III IIIIMMIIIIMIII III MM llllllllllllllllllllll I 

Db 1122 CGCCTCCGCCGCCAGCCGGCGCGAGCCCCCTGGCGGAGCCCGCCGCGCCCCCTTCCACGC 1181 

Qy 734 CGGCCGCGCCCAAGCGCAGGGGCTCCGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTG 7 93 

I i I I i I I I ': I I I I ! I I I ^ : I I I I I I I I I I I I I i ; I I 1 I I I I I I I I ! I I i I . : I 

Db 1182 CGGCCGCGCCCAAGCGCAGGGGCTCCGGCTCAGTGGATGAGACCCTTTTTGCTCTTCCTG 1241 

Qy 794 CTGCATCTGAGCCTGTGATACCCTCCTCTGCAG 826 

Milium i 1 1 1 1 i 

Db 1242 CTGCATCTGAACCTGTGATACCCTCCTCTGCAG 1274 



RESULT 9 
US-09-789-386-5 

Sequence 5, Application US/09789386 
Patent No. US20020010324A1 
GENERAL INFORMATION: 
APPLICANT: MICHALOVICH, DAVID 
APPLICANT: PRINJHA, RABINDER KUMAR 
TITLE OF INVENTION: NOVEL COMPOUNDS 
FILE REFERENCE: GP-30165-C1 

CURRENT APPLICATION NUMBER: US/09/789 , 386 
CURRENT FILING DATE: 2001-02-21 
PRIOR APPLICATION NUMBER: U.K. 9916898.1 
PRIOR FILING DATE: 1999-07-19 
PRIOR APPLICATION NUMBER: U.K. 9816024.5 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: 1999-07-22 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 5 
LENGTH: 1122 
TYPE: DNA 

ORGANISM: HOMO SAPIENS 
US-09-789-386-5 



Query Match 13.3%; Score 4 97.4; DB 9 

Best Local Similarity 92.7%; Pred. No. 3.4e-117 
Matches 522; Conservative 0; Mismatches 41 



Length 1122; 
Indels 0; Gaps 



Qy 3178 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 3237 



i IM I IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 556 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 615 

Qy 3238 TTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCC 3297 

II : II II i I I I II Mill Illlllllllll 

Db 616 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 675 

Qy 32 98 TTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATC 3357 

Mil! Mill I I I I I I Mill Illlllll llllll 

Db 676 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 735 

Qy 3358 CAGAAATCAGATGAAGGCC^CCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCA 3417 

IIIIIIIMIIIIIIIIIMIIIIIIIIIIIMIII ! IIIMIIIMIIIIIIIIII 

Db 736 CAGAAATCAGATGAAGG C CACCCATT CAGGGCATAT CTGGAATCTGAAGTTGCTATATCT 795 

Qy 3418 GAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAA 3477 

Mill lllllllllll II llllll IMIM MM, MM 1 1 1 1 Mill 

Db 796 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 855 

Qy 3478 GAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTG 3537 

Mill Mill II I I II I I I ,i MMMMIMMIMM 

Db 856 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 915 

Qy 3 53 8 ATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCT 3597 

Illlllll Mill IIIIIIIIIIIIIIIM MMMMIIMIMMMIMI III 

Db 916 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 975 

Qy 3 598 CTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCAT 3657 

II II IMMMIMM MMMMIIMMMMMMIMM MIMMIMM 

Db 976 CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 1035 

Qy 3658 TATCTAGGACTTG CAAACAAGAGTGTTAAGGATGCCATGG CCAAAAT CCAAG CAAAAATC 3717 

lllllllllllllllll MM llllll Mill Mill MMMMIMMIMM 

Db 103 6 TATCTAGGACTTG CAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAG CAAAAATC 10 95 

Qy 3 718 C CTGGATTGAAG CG CAAAG CAGA 374 0 

MIMMIMM 1 1 

Db 1096 CCTGGATTGAAGCGCAAAGCTGA 1118 



RESULT 10 
US-10-175-523-156' 

Sequence 156, Application US/10175523 
Publication No. US20030096264A1 
GENERAL INFORMATION: 
APPLICANT: Brockman, Jeffrey 
APPLICANT: Evans, David 
APPLICANT: Hook, Derek 
APPLICANT: Klimczak, Leszek 
APPLICANT: Laeng, Pascal 
APPLICANT: Palfreyman, Michael 
APPLICANT: Rajan, Prithi 

TITLE OF INVENTION: MULT I - PARAMETER HIGH THROUGHPUT SCREENING ASSAYS (MPHTS) 
FILE REFERENCE: 3235/1 J795 -US3 
CURRENT APPLICATION NUMBER: US/10/175 , 523 
CURRENT FILING DATE: 2002-06-18 
PRIOR APPLICATION NUMBER: US 60/299,151 



PRIOR FILING DATE: 2001-06-18 
PRIOR APPLICATION NUMBER: US 60/317,828 
PRIOR FILING DATE: 2001-09-07 
PRIOR APPLICATION NUMBER: US 60/325,150 
PRIOR FILING DATE: 2001-09-25 
PRIOR APPLICATION NUMBER: US 60/333,047 
PRIOR FILING DATE: 2001-11-14 
PRIOR APPLICATION NUMBER: US 60/349,936 
PRIOR FILING DATE: 2002-01-18 
PRIOR APPLICATION NUMBER: US 60/361,834 
PRIOR FILING DATE: 2002-03-04 
NUMBER OF SEQ ID NOS : 197 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 156 

LENGTH: 1160 

TYPE: DNA 



US- 


ORGANISM: 
10-175-523- 


Homo sapiens 
-156 




Query Match 13.3%; Score 4 97.4; DB 15; Length 1160; 
Best Local Similarity 92.7%; Pred. No. 3.5e-117; 

Matches 522; Conservative 0; Mismatches 41; Indels 0; Gaps 


0; 


Qy 


3178 


GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 

IIIIIIIIIIIMI 

GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 


3237 


Db 


228 


287 


Qy 


3238 


TTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCC 

IIIIMMIIIII II lllllll i li'l II Mill Mil 

CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 


3297 


Db 


288 


347 


Qy 


3298 


TTGG CCCTG CT CT CGGTGACTATCAG CTTTAGGATATATAAGGG CGTGATCCAGGCTATC 

II 1 1. 1 Mill 1 1 1 II T Mill IIMIIII MUM 

TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 


3357 


Db 


348 


407 


Qy 


3358 


CAGAAATCAGATGAAGGC CACC CATT CAGGGCATATTTAGAAT CTGAAGTTG CTATATCA 

IIIIIMMIIMIIIIIMMIIIIIIIIIIIIII 1 1 M 1 1 1 II M M 11 1 1 1 1 1 1 

(^GAAATCAGATGAAGGCC^CCCATTCAGGGCATATCTGGAATCTGAAGTTGCTATATCT 


3417 


Db 


408 


467 


Qy 


3418 


GAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGTGAACAGCACAATAAAA 

Mill MIMMIMI IMIMIMIMMMMMIIMIIMM MM Mill 

GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 


3477 


Db 


468 


527 


Qy 


3478 


GAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTG 

Lllll Mill II IIIIIIIIIIIIIIIMIIIIMIM IIIIIMIIIIIIIIIII 

GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 


3537 


Db 


528 


587 


Qy 


3538 


ATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCT 

1 Mill MMMMIMMIMI 1 li II 1 '1 1 1 III 

ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 


3597 


Db 


588 


647 


Qy 


3598 


CTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCAT 


3657 


Db 


648 


II II llllllllllll llllllllllll 
CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 


707 


Qy 


3658 


TATCTAGGACTTGCAAACAAGAGTGTTAAGGATGCCATGGCCAAAATCC^GCAAA 

MM llllll MM MM MM Mill lllll 1 1 1 M II II 1 1 1 II II 1 1 


3717 



Db 708 TATCTAGGACTTGCAAATAAGAATGTTAAAGATGCTATGGCTAAAATCCAAGCAAAAATC 767 



Qy 3 718 CCTGGATTGAAGCGCAAAGCAGA 374 0 

Mi: 1 ,. MINIMI Mi II 

Db 7 68 C CTGGATTGAAG CG CAAAG CTGA 790 



RESULT 11 
US-10-439-388-62 

; Sequence 62, Application US/10439388 

; Publication No. US2003 0228617A1 

; GENERAL INFORMATION: 

; APPLICANT : Aune, Thomas M 

; APPLICANT: 01 sen, Nancy J 

TITLE OF INVENTION: Method for Predicting Autoimmune Disease 
; FILE REFERENCE: 1242/68 

; CURRENT APPLICATION NUMBER: US/10/439 , 388 
CURRENT FILING DATE: 2003-05-16 
PRIOR APPLICATION NUMBER: US 60/381,055 
PRIOR FILING DATE: 2002-05-16 
; NUMBER OF SEQ ID NOS : 70 

SOFTWARE: Patent In version 3.2 
; SEQ ID NO 62 

LENGTH: 1785 
TYPE: DNA 



us- 


ORGANISM: 
10-439-388- 


Homo sapiens 
-62 




Query Match 13.3%; Score 497.4; DB 12; Length 1785; 
Best Local Similarity 92.7%; Pred. No. 4.8e-117; 

Matches 522; Conservative 0; Mismatches 41; Indels 0; Gaps 


0; 


Qy 


3178 


GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 

MIMI M 1 M MMMMMIM IMIMM IMIIMI M IMM 

GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 


3237 


Db 


247 


306 


Qy 


3238 


TTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCC 

MIIMI MM II MM lllllllllll II Mill III Mill 

CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 


3297 


Db 


307 


366 


Qy 


3298 


TTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATC 

llllllllllllll Mill IMIIMI MIIMI HIM MMMM MMM 

TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 


3357 


Db 


367 


426 


Qy 


3358 


CAGAAATCAGATGAAGGCCACCQVTTCAGGGCATATTTAGAATCTGAAGTTGCTATATCA 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 1 

CAGAAATCAGATGAAGG C CAC CCATTCAGGG CATAT CTGGAAT CTGAAGTTGCTATATCT 


3417 


Db 


427 


486 


Qy 


3418 


GAGGAATTGGTT(JAGAAATACAGTAATTCTGCTCTTGGT(^TGTGAACAGCACAATAA^ 

Mill MMMM: M 1 1 1 M 1 II 1 1 1 1 M II 1 M 1 1 1 M II M MM Mill 

GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTGAACTGCACGATAAAG 


3477 


Db 


487 


546 


Qy 


3478 


GAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTG 

Mill Mill II Mill MMMM MM M M MM MMMM MIMI 

GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 


3537 


Db 


547 


606 


Qy 


3538 


ATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCT 


3597 



IIIIIIII Mill hi, I I 'I Ml'! 'Ml I I: i Ml 

Db 607 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 666 

Qy 3598 CTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCAT 3657 

II II llllllllllll IMMI IIIIIIIIIIIIII IIIIIM IMMIIIMM 

Db 667 CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 726 

Qy 3658 TAT CTAGGACTTG CAAACAAGAGTGTTAAGGATGCCATGG CCAAAATCCAAG CAAAAATC 3717 

IIIIIM MM Illlll Mill Mill M I I I MM 

Db 727 TATCTAGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAAAATCCAAG CAAAAATC 786 

Qy 3718 CCTGGATTGAAGCG CAAAG CAGA 374 0 

1 1 1 1 1 1 1 ! I ! 1 1 1 ' . 1 1 1 1 II 

Db 787 CCTGGATTGAAGCGCAAAGCTGA 8 09 



RESULT 12 
US-10-060-036-54 

Sequence 54, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 
APPLICANT: Benson, Darin R. 
APPLICANT: Kalos, Michael D. 
APPLICANT: Lodes , Michael J . 
APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/10/060,036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS : 4560 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 54 
LENGTH: 223 5 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-060-036-54 

Query Match 13.3%; Score 4 97.4; DB 15; Length 223 5; 

Best Local Similarity 92.7%; Pred. No. 5.7e-117; 

Matches 522; Conservative 0; Mismatches 41; Indels 0; Gaps 0; 
Qy 3178 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 323 7 

II Mill IIIIIM III 1 1 Ml II I Mill 1 1 1 1 1 1 1 1 M 1 1 1 1 lllllllllllll II 

Db 697 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 756 

Qy 3238 TTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCC 32 97 

lllllllllllll II IIIIIM Illllllllll II Mill llllllllllll 

Db 757 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 816 

Qy 32 98 TTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATC 33 57 

IIIIIIIIIIIIII Mill IIIIIMIIIMIIII! Mill IIIIIIII Illlll 

Db 817 TTGGCCCTGCTCTCTGTGACCATCAGCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 87 6 



Qy 



3358 CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCA 3417 



: i I ;. ' : i ii I i i : i i mmmmmmmmimi 

7 7 CAGAAAT CAGATGAAGG CCACCCATTCAGGG CATATCTGGAATCTGAAGTTGCTATAT C 



Db 


877 


CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATC^ 1 L 1 


y jo 


Qy 


3418 


GAGGAATTGGTTCAGAAATACAGTAATT CTG CTCTTGGTCATGTGAACAGCACAATAAAA 

Mill Illllllllll 1 1 II M Ml Ml 1 II 1 1 1 1 1 III Ml 1 M 1 MM Mill 

GAGGAGTTGGTTCAGAAGTACAGTAATTCTG CTCTTGGT CATGTGAACTG CACGATAAAG 


J4 / / 


Db 


937 


n n f 


Qy 


3478 


GAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTG 

Mill Mill II MMMMMMMIMIMMIMI 1 

GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 


~) C O 

353 / 


Db 


997 


1056 


Qy 


3538 


ATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCT 


3 59/ 


Db 


1057 


Illlllll Mill lllllllllllllllll 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 E 1 1 1 ( 

ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 


1116 


Qy 


3598 


CTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCAT 


3657 


Db 


1117 


II II MIMMIMM III III MUM llllllllll Mill llllllllllll 

CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 


1176 


Qy 


3658 


TATCTAGGACTTGCAAACAAGAGTGTTAAGGATGCCATGGCCAAAATCCAAGCAAA^ 

lllllllllllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TAT CTAGGACTTGCAAATAAGAATGTTAAAGATGCTATGG CTAAAATC CAAG CAAAAAT C 


3717 


Db 


1177 


1236 


Qy 


3718 


C CTGGATTGAAG CG CAAAG CAGA 3 74 0 

IIIIIIIMIIIIIIMIII II 

C CTGGATTGAAG CG CAAAG CTGA 1259 




Db 


1237 




RESULT 
US-09- 


13 

765-205 


-5 





Sequence 5, Application US/09765205 
Patent No. US2 0020034800A1 



; GENERAL INFORMATION: 
; APPLICANT: Cao, Li 

; TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 

; FILE REFERENCE: 1458.004/200130.449 

; CURRENT APPLICATION NUMBER: US/09/765 , 205 

; CURRENT FILING DATE: 2001-01-17 

; PRIOR APPLICATION NUMBER: US/09/212 , 440 

PRIOR FILING DATE: 1998-12-16 
; NUMBER OF SEQ ID NOS : 4 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 5 

LENGTH: 1610 

TYPE : DNA 

ORGANISM: human 
US-09-765-205-5 

Query Match 13.3%; Score 495.8; DB 9; Length 1610; 

Best Local Similarity 92.5%; Pred. No. 1.2e-116; 

Matches 521; Conservative 0; Mismatches 42; Indels 0; Gaps 0; 



Qy 3178 GTTGTTGACCTCCTCTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 3237 

Illlllllllllll MM IMIMII MM IMMIII III MM MUM 

Db 687 GTTGTTGACCTCCTGTACTGGAGAGACATTAAGAAGACTGGAGTGGTGTTTGGTGCCAGC 74 6 

Qy 3238 TTATTCCTGCTGCTGTCTCTGACAGTGTTCAGCATTGTCAGTGTAACGGCCTACATTGCC 32 97 



MINIM' II M Ml IIMIIIIIII II Mill Mil MUM 

Db 74 7 CTATTCCTGCTGCTTTCATTGACAGTATTCAGCATTGTGAGCGTAACAGCCTACATTGCC 806 

Qy 32 98 TTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGGCGTGATCCAGGCTATC 3357 

Illlllllllllll Mill IMMMIMMIMM Mill Illlllll llllll 

Db 807 TTGGCCCTGCTCTCTGTGACCATCA.GCTTTAGGATATACAAGGGTGTGATCCAAGCTATC 866 

Qy 3358 CAGAAATCAGATGAAGGCCACCCATTCAGGGCATATTTAGAATCTGAAGTTGCTATATCA 3417 

1 1 1 1 1 1 1 1 1 1 1 1 1 M I II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II I IIMIIIIIII II Mill 

Db 8 67 CAGAAATCAGATGAAGG CCAC CCATTCAGGG CATAT CTGGAAT CTGAAGTTG CTATATCT 926 

Qy 3418 GAGGAATTGGTTCAGAAATAC^GTAATTCTGCTCTTGGT(^TGTGAACAGCACAATAAAA 3477 

Mill MIMIIMM llllllllllllllllllllllllll Ml MM Mill 

Db 927 GAGGAGTTGGTTCAGAAGTACAGTAATTCTGCTCTTGGTCATGTCAACTGCACGATAAAG 98 6 

Qy 3478 GAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCTGAAGTTTGCAGTGTTG 3537 

Mill Mill II MMIIMIIIIIMIIIIIIIIIM 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 

Db 98 7 GAACTCAGGCGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCTGAAGTTTGCAGTGTTG 104 6 

Qy 353 8 ATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGACACTACTGATTTTAGCT 3 597 

Illlllll Mill lllllllllllllllll IIIIIIIIIIIMIIIIIIIIII III 

Db 1047 ATGTGGGTATTTACCTATGTTGGTGCCTTGTTTAATGGTCTGACACTACTGATTTTGGCT 1106 

Qy 3598 CTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCAGGTGCAGATAGATCAT 3657 

M M I i 1 1 1 1 1 1 i 1 1 i MMiMMMMMMMlllMlll 

Db 1107 CTCATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCAGGCACAGATAGATCAT 1166 

Qy 3658 TATCTAGGACTTG CAAACAAGAGTGTTAAGGATG C CATGG CCAAAATC CAAG CAAAAATC 3717 

Mill Mill II I MM MM MUM Mill Mill MUM M llllll 

Db 1167 TATCTAGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAAAATC CAAG CAAAAATC 122 6 

Qy 3718 C CTGGATTGAAG CG CAAAG CAGA 3 74 0 

MMMMMMIMIMM II 
Db 1227 C CTGGATTGAAG CG CAAAG CTGA 124 9 



RESULT 14 
US-09-789-386-3 

; Sequence 3, Application US/09789386 

; Patent No, US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

FILE REFERENCE: GP-30165-C1 
; CURRENT APPLICATION NUMBER: US/09/78 9,386 
; CURRENT FILING DATE: 2001-02-21 

PRIOR APPLICATION NUMBER: U.K. 9916898.1 
; PRIOR FILING DATE: 1999-07-19 

PRIOR APPLICATION NUMBER: U.K. 9816024.5 

PRIOR FILING DATE: 1998-07-22 
; PRIOR APPLICATION NUMBER: US 09/359,208 
; PRIOR FILING DATE : 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 3 

LENGTH: 868 



TYPE: DNA 

ORGANISM: HOMO SAPIENS 
FEATURE : 

NAME /KEY: UNSURE 
LOCATION: (91) (413) 
US-09-789-386-3 

Query Match 12.9%; Score 483.6; DB 9; Length 868; 

Best Local Similarity 76.8%; Pred. No. le-113; 

Matches 668; Conservative 0; Mismatches 166; Indels 36; Gaps 5; 



Qy 830 AAATTATGGATTTGATGGAGCAGCCAGGTAACACTGTTTCGTCTGGTCAAGAGGATTTCC 88 9 

III MINI 1 1 1 1 I! Illlll II MM Mill I II llllll IIMi 

Db 2 AAAATATGGACTTGAAGGAG CAG C CAGGTAACACTATTT CGG CTGGT CAAGAGGATTTCC 61 

Qy 890 CATCTGTCCTGCTTGAAACTGCTGCCTCTCTTCCTTCTCTATCTCCTCTCTCAACTGTTT 94 9 

IIIMIIIIIIIIIIIIIIIIIIII Ml IIMIIMM 1 1 1 1 1 1 1 1 1 1 i 1 I I II 

Db 62 CATCTGTCCTGCTTGAAACTGCTGCTTCTNTTCCTTCTCTGTCTCCTCTCTCAGCCGCTT 121 

Qy 950 CTTTTAAAGAACATGGATACCTTGGTAACTTATCAGCAGTGTCATCCTCAGAAGGAACAA 1009 

I I I I I 3 I I I I I llllllllllll II III MM Mill llllll 

Db 122 CTTTCAAAGAACATGAATACCTTGGTAATTTGTCAACAGTATTACCCACTGAAGGAACAC 181 

Qy 1010 TTGAAGAAACTTTAAATGAAGCTTCTAAAGAGTTGCC^GAGAGGGCAACAAATCCATTTG 1069 

II MM | I I IIIIIIIIIIIIIMI I llllll Mill II II I I 

Db 182 TTC^GAAAATGTCAGTGAAGCTTCTAAAGAGGTCTCAGAGAAGGCAAAAACTCTACTCA 241 

Qy 107 0 TAAATAGAGATTTAG CAGAATTTTCAGAATTAGAATATTCAGAAATGGGATCAT CTTTTA 112 9 

ii Mini;: mm mmmmimmimi mimmmmiimm m i 

Db 242 TAGATAGAGATTTAACAGAGTTTTCAGAATTAGAATACTCAGAAATGGGATCATCGTTCA 3 01 

Qy 113 0 AAGGCTCCCCAAAAGGAGAGTCAGCCATATTAGTAGAAAACACTAAGGAAGAAGTAATTG 118 9 

I III Illllll III II III II llllll III Ml Mill MM I 

Db 302 GTGTCTCTCCAAAAGCAGAATCTGCCGTAATAGTAGCAAATCCTAGGGAAGAAATAATCG 361 
Qy 1190 TGAGGAGTAAA GACAAAGAGGATTTAGTTTGTAGTGCAGCCCTTCACAGTCCACAAG 1246 

III I MM II Mill I llllll III I llllll I II IIMI 

Db 362 TGAAAAATAAAGATGAAGAAGAGAAGTTAGTTAGTAATAACATCCTT CATANT CAACAAG 421 
Qy 124 7 AATCACCT GTGGGTAAAGAAGACAGAGTTGTGTCTCCAGAAAAGA 12 91 

I 1 MM III llllll II llllllllll MM 

Db 422 AGTTACCTACAGCTCTTACTAAATTGGTTAAAGAGGATGAAGTTGTGTCTTCAGAAAAAG 481 

Qy 12 92 CAATGGACATTTTTAATGAAATG CAGATGT CAGTAGTAG CAC CTGTGAGGGAAGAGTATG 1351 

Ml MM I II MM I I MM I Ml III Illllll II MM 

Db 482 CAAAAGACAGTTTTAATGAAAAGAGAGTTG CAGTGGAAG CT C CTATGAGGGAGGAATATG 541 

Qy 1352 CAGACTTTAAG CCATTTGAACAAGCATGGGAAGTGAAAGATACTTATGAGGGAAGTAGGG 1411 

Illllll II MM I II MIMMMMMIMI I III I III I 

Db 54 2 CAGACTTCAAACCATTTGAGCGAGTATGGGAAGTGAAAGATA GTAAGGAAGATAGTG 5 98 

Qy 1412 ATGTGCTGGCTGCTAGAGCT AATGTGGAAAGTAAAGTGGACAGAAAAT 1459 

II II llllllll III I II MMMMIMIMM I IIMI 

Db 59 9 ATATGTTGGCTGCTGGAGGTAAAATCGAGAGCAACTTGGAAAGTAAAGTGGATAAAAAAT 658 

Qy 14 6 0 G CTTGGAAGATAG CCTGGAG CAAAAAAGTCTTGGGAAGGATAGTGAAGG CAGAAATGAGG 1519 

! II I MINIMI Illllll I II I II llllllll I II IIMI I 



Db 



65 9 GTTTTGCAGATAGCCTTGAGCAAACTAATCACGAAAAAGATAGTGAGAGTAGTAATGATG 718 



Qy 152 0 ATGCTTCTTTCCCCAGTACCCCAGAACCTGTGAAGGACAGCTCCAGAGCATATATTACCT 1579 

II I I I I : i I I I i I I I I III I I lllll I II Ml Mill II I 

Db 719 ATACTTCTTTCCCCAGTACGCCAGAAGGTATAAAGGATCGTTCAGGAGCATATATCACAT 778 
Qy 1580 GTGCTTCCTTTA CCTCAGCAACCGAAAGCACCACAGCAAACACTTTCCCTTTGTTAG 1636 

Mill MM I IMM II 1 1 1 1 1 1 II Ml MIMIIMI 

Db 77 9 GTGCTCCCTTTAACCCAGCAGCAACTGAGAGCATTGCAACAAACATTTTTCCTTTGTTAG 838 

Qy 1637 AAGATCATACTT CAGAAAATAAAACAGATG 1666 

Mill IMMIM Ml I II MM 

Db 83 9 GAGATC CTACTTCAGAAAATAAGAC CGATG 868 



RESULT 15 

US-09-960-352-8477 

Sequence 8477, Application US/09960352 
Patent No. US2002013713 9A1 
GENERAL INFORMATION: 
APPLICANT: Warren, Wesley C. 
APPLICANT: Tao, Nengbing 
APPLICANT: Byatt , John C. 
APPLICANT: Mathialagan, Nagappan 

TITLE OF INVENTION: NUCLEIC ACID AND OTHER MOLECULES ASSOCIATED WITH 
LACTATION AND 

TITLE OF INVENTION: MUSCLE AND FAT DEPOSITION 
FILE REFERENCE: 16511 . 006/37-21 ( 10298 ) C 
CURRENT APPLICATION NUMBER : US/09/960,352 
CURRENT FILING DATE: 2001-09-24 
NUMBER OF SEQ ID NOS : 15112 
SEQ ID NO 8477 
LENGTH: 422 
TYPE: DNA 

ORGANISM: Bos taurus 

OTHER INFORMATION: Clone ID: 3 6 -LI B34 - 04 8 -Ql -El -A8 
US-09-960-352-8477 

Query Match 10.0%; Score 375.6; DB 10; Length 422; 

Best Local Similarity 93.1%; Pred. No. 4.3e-86; 

Matches 393; Conservative 0; Mismatches 29; Indels 0; Gaps 0; 
Qy 3282 AACGGCCTACATTGCCTTGGCCCTGCTCTCGGTGACTATCAGCTTTAGGATATATAAGGG 3341 

MIMMMMMMMIMMIMIMM IIIIIIIIIIIIIIIMIIIIIIIIIIII 

Db 1 AACGGCCTACATTGCCTTGGCCCTGCTCTCTGTGACTATCAGCTTTAGGATATATAAGGG 60 

Qy 3342 CGTGATCCAGGCTATCCAGAAATCAGATGAAGGCCACCCATTCAGGGCATATT 34 01 

Ml IMIMM IMIMI 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 1 1 1 i I i 1 1 1 ! I ! lllll 

Db 61 TGTGATCCAGGCTATCCAGAAATCTGATGAAGGC(^CCCATTCAGGGCATATTTGGAATC 12 0 

Qy 34 02 TGAAGTTGCTATATCAGAGGAATTGGTTCAGAAATACAGTAATTCTGCTCTTGGTCATGT 34 61 

i : i : 1 1 1 1 1 1 1 1 1 1 mil iiiiiiiiiii inn 1 1 i r 1 1 1 1 1 1 i 1 1 1 1 1 1 1 ( i 

Db 121 TGAAGTTGCTATATCTGAGGAGTTGGTTCAGAAGTACAGCAATTCTGCTCTTGGTCATGT 18 0 

Qy 34 62 GAACAGCACAATAAAAGAACTGAGGCGGCTTTTCTTAGTTGATGATTTAGTTGATTCCCT 3521 

III MIIIMI III II I | II I I I I IM 111 I I I I I I I I I . I 1 I I II 

Db 181 TAACTGCACAATAAAAGAACTCAGACGCCTCTTCTTAGTTGATGATTTAGTTGATTCTCT 24 0 



Qy 3522 GAAGTTTGCAGTGTTGATGTGGGTGTTTACTTATGTTGGTGCCTTGTTCAATGGTCTGAC 3581 

Illlllll IHIIIMIIIMI Mill MilMIMIIIIIIMI llllllllll 

Db 241 GAAGTTTGCAGTGTTGATGTGGGTATTTACCTATGTTGGTGCCTTGTTCAATGGTCTGAC 3 00 

Qy 3582 ACTACTGATTTTAGCTCTGATCTCACTCTTCAGTATTCCTGTTATTTATGAACGGCATCA 3641 

MINI Mill Illlllll llllllllllll Mill MlllllllhlMIMIII 

Db 3 01 ACTACTAATTTTGGCTCTGATTTCACTCTTCAGTGTTCCTGTTATTTATGAACGGCATCA 360 

Qy 3642 GGTGCAGATAGATCATTATCTAGGACTTG CAAACAAGAGTGTTAAGGATG CCATGG CCAA 3701 

II III I II II I II 1 1 II 1 1 1 1 III 1 1 MM MM MUM Mill Mill II 

Db 361 GG CGCAAATAGATCATTAT CTGGGACTTG CAAATAAGAATGTTAAAGATG CTATGG CTAA 42 0 

Qy 3702 AA 3703 

II 

Db 421 AA 422 
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